Post recenti
Pag. 77 di 85 · 1,011 post
Pubblicato 16 set
Hugging Face (Twitter) RT @MaziyarPanahi: Introducing 90+ open-source, state‑of‑the‑art biomedical and clinical zero‑shot NER models on @HuggingFace by @OpenMed_AI Apache‑2.0 licensed and ready to use Built on GLiNER and covering 12+ biomedical datasets 🧵 (1/6)
Pubblicato 16 set
Hugging Face (Twitter) RT @nathanhabib1011: 🚀 Just updated lighteval’s readme—can’t believe we’ve grown to cover ~7,000 tasks 😳 with top-tier multilingual support 🌍 llm as judge 🤖 multiturn evals 🗣️ coding benchmarks 🧑💻
Pubblicato 16 set
Hugging Face (Twitter) RT @Ali_TongyiLab: 1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam, 45.3 on BrowseComp, and 75.0 on the xbench-DeepSearch benchmark.
Pubblicato 16 set
Hugging Face (Twitter) RT @reach_vb: Talking about the state of Open Source LLMs at @aiDotEngineer next week! 🔥 Quite excited for the talk and meeting everyone - let's goo! 🤗
Pubblicato 16 set
Hugging Face (Twitter) RT @laurentsifre: We’ve been cooking this summer: Holo1.5 is here! SOTA UI localization + QA, 3× gains vs Qwen-2.5 VL 🍳 Now up to 72B 💥 — a strong base for computer-use agents like Surfer. • Open weights on HuggingFace 🤗https://huggingface.co/Hcompany/Holo1.5-7B • Blog post 📝hcompany.ai/blog/holo-1-5 (1/n 🧵)
Pubblicato 13 set
Hugging Face (Twitter) RT @LucSGeorges: we've been pushing commits to transformers discretely, time to talk about we've been cooking the last few months: ⚡️ Continuous Batching is in transformers ⚡️ this will simplify, most notably, evaluation and your training loop: no need for extra dependencies or infra to get fast inference, and no need for convoluted code to update your weights note that speed is currently not on par with the best inference frameworks and servers out there and probably never will be the goal is *not* to become as fast: we want to complement the existing landscape with features like these, aiming for transformers to be the toolbox for tinkering with and building models
Pubblicato 13 set
Hugging Face (Twitter) RT @art_zucker: 🚀 Big news: we’re moving towards the v5 release of transformers! After months of teasing, it’s finally happening 🎉 What to expect in v5: ✨ Cutting-edge stack — fast models, with fast kernels ✨ Smarter defaults — better out-of-the-box experience ✨ Cleaner codebase — warnings & legacy bits removed The goal? To make transformers the most robust, modern, and developer-friendly ML library out there. Stay tuned — it’s going to be huge. 🔥
Pubblicato 12 set
Hugging Face (Twitter) RT @GroqInc: You can now access Groq models directly in VS @code with @huggingface. Just BYOK. 🔑
Pubblicato 12 set
Hugging Face (Twitter) RT @hanouticelina: Starting today, you can use Hugging Face Inference Providers directly in GitHub Copilot Chat on @code! 🔥 which means you can access frontier open-source LLMs like Qwen3-Coder, gpt-oss and GLM-4.5 directly in VS Code, powered by our world-class inference partners - @CerebrasSystems, @Cohere_Labs, @FireworksAI_HQ, @GroqInc, @novita_labs, @togethercompute & more! give it a try today! 🧵👇
Pubblicato 12 set
Hugging Face (Twitter) RT @reach_vb: BOOM! Starting today you can use open source frontier LLMs in @code with HF Inference Providers! 🔥 Use your inference credits on SoTA llms like GLM 4.5, Qwen3 Coder, DeepSeek 3.1 and more All of it packaged in one simple extension - try it out today 🤗
Pubblicato 12 set
Hugging Face (Twitter) RT @reach_vb: You DO NOT want to miss this - All the tricks and optimisations used to make gpt-oss blazingly fast, all of it - in a blogpost (with benchmarks)! 🔥 We cover details ranging from MXFP4 quantisation to, pre-built kernels, Tensor/ Expert Parallelism, Continuous Batching and much more Bonus: We add extensive benchmarks (along with reproducible scripts)! ⚡
Pubblicato 12 set
Hugging Face (Twitter) RT @_akhaliq: Qwen3-Next-80B-A3B is out 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) Qwen3-Next-80B-A3B-Instruct approaches our 235B flagship. Qwen3-Next-80B-A3B-Thinking outperforms Gemini-2.5-Flash-Thinking both now available in anycoder for vibe coding