Post recenti
Pag. 67 di 85 · 1,011 post
Pubblicato 8 ott
Hugging Face (Twitter) RT @AntLingAGI: 🚀 Ling-1T — Trillion-Scale Efficient Reasoner Introducing Ling-1T, the first flagship non-thinking model in the Ling 2.0 series — 1 Trillion total parameters with ≈ 50 B active per token, trained on 20 T+ reasoning-dense tokens. Highlights → Evo-CoT curriculum + Linguistics-Unit RL for scalable reasoning → Strong efficiency–accuracy balance on complex reasoning tasks → Advanced visual understanding + front-end code generation via Syntax–Function–Aesthetics reward → Emergent tool-use ability (≈ 70 %) with minimal instruction tuning → FP8 mixed-precision + Ling Scaling Law → efficient trillion-scale training Efficient Thinking · Precise Reasoning Ling-1T extends the Pareto frontier of reasoning accuracy vs. cost — a new milestone in open-source trillion-scale intelligence.
Pubblicato 8 ott
Hugging Face (Twitter) RT @ClementDelangue: The @LeRobotHF team is studying the @UnitreeRobotics G1 today in case you have any questions or fun stuff you want us to try!
Pubblicato 8 ott
Hugging Face (Twitter) RT @wjb_mattingly: I'm a huge fan of hf jobs. Working on a way to do the same thing on Yale's HPC. Should work for most HPCs using slurm. It handles dataset creation, job creation, job submission, ssh, etc. This is in no small part thanks to @vanstriendaniel 's great work and the team at @huggingface .
Pubblicato 8 ott
Hugging Face (Twitter) RT @ClementDelangue: Very cool paper! You can discuss with the author here: https://huggingface.co/papers/2510.04871
Pubblicato 8 ott
Hugging Face (Twitter) RT @jm_alexia: TRM is now the #1 trending paper on the Daily Papers
Pubblicato 8 ott
Hugging Face (Twitter) RT @xenovacom: Introducing Granite Docling WebGPU 🐣 State-of-the-art document parsing 100% locally in your browser! 🤯 🔐 No data sent to a server (private & secure) 💰 Completely free... forever! 🔂 Docling ecosystem enables conversion to HTML, Markdown, JSON, and more! Try out the demo! 👇
Pubblicato 7 ott
Hugging Face (Twitter) RT @ClementDelangue: The community added 1 million new repos (models, datasets, spaces) on @huggingface in the past 90 days! For context, it took six years to get to the first million repositories. That's now a new repositories created on HF every 8 seconds. What's cool is that: - 100% are now powered by Xet, our technology for faster, cheaper, more efficient data transfer. Lots of exciting features to come unlocked by this like in-browser GGUF editing we just announced - 40% are private repositories which shows that people are increasingly using the hub internally within their organizations to share weights, datasets and demos. Enterprise hub subscriptions are our fastest growing line of revenue. Next milestone is to reach 10 million total repositories! Ultimately there will be more AI repositories than code repositories for all to build AI thanks to open-source. Let's go!
Pubblicato 7 ott
Hugging Face (Twitter) RT @vanstriendaniel: DoTS.ocr from @xiaohongshu just got native @vllm_project support! I built a UV script so you can run SOTA multilingual OCR in seconds with zero setup using @huggingface Jobs Tested on 1800s library cards - works great ✨
Pubblicato 7 ott
Hugging Face (Twitter) RT @maximelabonne: LFM2-8B-A1B just dropped on @huggingface! 8.3B params with only 1.5B active/token 🚀 > Quality ≈ 3–4B dense, yet faster than Qwen3-1.7B > MoE designed to run on phones/laptops (llama.cpp / vLLM) > Pre-trained on 12T tokens → strong math/code/IF
Pubblicato 7 ott
Hugging Face (Twitter) RT @TheZachMueller: Smol MoE's are here https://huggingface.co/LiquidAI/LFM2-8B-A1B
Pubblicato 7 ott
Hugging Face (Twitter) RT @ArtificialAnlys: Recent open weights releases are reducing the gap to proprietary frontier models on agentic workflows On the Terminal-Bench Hard evaluation for agentic coding and terminal use, open-weights models such as DeepSeek V3.2 Exp, Kimi K2 0905, and GLM-4.6 have made large strides, with DeepSeek surpassing Gemini 2.5 Pro. These advances reflect significantly higher capability for use in coding and other agent use cases, and developers have a wider range of model options than ever for these applications. See below for our analysis of the price and performance of providers to help you make use of these leading models 👇
Pubblicato 7 ott
Hugging Face (Twitter) RT @_lewtun: Just smuggled these trade secrets into Canada