Post recenti
Pag. 1 di 85 · 1,011 post
Pubblicato 24 giorni fa
AI telegram bot. — @aigram
Pubblicato 17 feb
Hugging Face (Twitter) RT @evijit: Today, @evaluatingevals is introducing Every Eval Ever, a unified, open data format and public dataset for AI evaluation results.
Pubblicato 17 feb
Hugging Face (Twitter) RT @Cohere_Labs: Introducing ✨Tiny Aya✨, a family of massively multilingual small language models built to run where people actually are. Tiny Aya delivers strong multilingual performance in 70+ global languages in a 3.35B parameter model, efficient enough to run locally, even on a phone.
Pubblicato 17 feb
Hugging Face (Twitter) RT @Cohere_Labs: Very special to work with our @huggingface friends to bring Tiny Aya, the most capable multilingual open-weight model at its scale to the world! 🚀 Big thanks to @ngxson for the huge help merging the changes into llama.cpp.
Pubblicato 16 feb
Hugging Face (Twitter) RT @AdinaYakup: Happy Spring Festival🧧🐎 Here’s to another year of building and sharing! 新的一年,继续开源同行 🤗
Pubblicato 16 feb
Hugging Face (Twitter) RT @NielsRogge: Dots.ocr is known for being among the SOTA for OCR Looks like the new 1.5 model is SOTA on OlmOCRBench! https://huggingface.co/rednote-hilab/dots.ocr-1.5
Pubblicato 16 feb
Hugging Face (Twitter) RT @Alibaba_Qwen: 🚀 Qwen3.5-397B-A17B is here: The first open-weight model in the Qwen3.5 series. 🖼️Native multimodal. Trained for real-world agents. ✨Powered by hybrid linear attention + sparse MoE and large-scale RL environment scaling. ⚡8.6x–19.0x decoding throughput vs Qwen3-Max 🌍201 languages & dialects 📜Apache2.0 licensed 🔗Dive in: GitHub: github.com/QwenLM/Qwen3.5 Chat: chat.qwen.ai API:https://modelstudio.console.alibabacloud.com/ap-southeast-1/?tab=doc#/doc/?type=model&url=2840914_2&modelId=group-qwen3.5-plus Qwen Code: github.com/QwenLM/qwen-code Hugging Face: https://huggingface.co/collections/Qwen/qwen35 ModelScope: https://modelscope.cn/collections/Qwen/Qwen35 blog: qwen.ai/blog?id=qwen3.5
Pubblicato 16 feb
Hugging Face (Twitter) RT @RisingSayak: Will be there at the @OfficialINDIAai Impact Summit in Delhi from 17-19. Will also present ReflectionFlow at the symposium on the 18th. Kinda surprised there’s no real discussion about open science and open source, given the stellar speakers. Anyway, looking forward to it!
Pubblicato 15 feb
Hugging Face (Twitter) RT @j_dekoninck: Introducing QED-Nano: a 4B model for mathematical proof writing, competitive with larger models like GPT-OSS-120B. We open-source our entire pipeline, including data, code, and a blog post, hoping that the community can build on these artifacts to create more specialized models.
Pubblicato 15 feb
Hugging Face (Twitter) RT @_lewtun: We trained a tiny 4B model to reason for millions of tokens through IMO-level problems. Heaps excited to share our new blog post covering the full pipeline, from distilling the 🐳 to augmenting RL with a reasoning cache that unlocks extreme inference-time scaling for theorem proving. https://huggingface.co/spaces/lm-provers/qed-nano-blogpost
Pubblicato 14 feb
Hugging Face (Twitter) RT @RisingSayak: An ambitious project today 🔥 We got an agent to write custom kernels that actually work for a given model, hardware instruction set, and other relevant model-dependent constraints. Benchmarks are our rewards here 🤪 We got these kernels to work with Diffusers and `torch.compile` and they delivered ACTUAL SPEEDUP without messing up the quality 🏆 Despite the competitive landscape, we don't like to keep things private. Read all of it in the blog post below: https://huggingface.co/blog/custom-cuda-kernels-agent-skills
Pubblicato 12 feb
Hugging Face (Twitter) 💚💚💚https://twitter.com/NVIDIAAIDev/status/2022023402047799389#m