Post recenti
Pag. 44 di 85 · 1,011 post
Pubblicato 20 nov
Hugging Face (Twitter) RT @kylelostat: we released Olmo 3! lot of exciting stuff but wanna focus on: 🐟Olmo 3 32B Base, the best fully-open base model to-date, near Qwen 2.5 & Gemma 3 on diverse evals 🐠Olmo 3 32B Think, first fully-open reasoning model approaching Qwen 3 levels 🐡12 training datasets corresp to different staged training recipes, all open & accessible since I'm a pretraining person, I'll share some of my fav Base model ideas:
Pubblicato 20 nov
Hugging Face (Twitter) RT @natolambert: We present Olmo 3, our next family of fully open, leading language models. This family of 7B and 32B models represents: 1. The best 32B base model. 2. The best 7B Western thinking & instruct models. 3. The first 32B (or larger) fully open reasoning model. This is a big milestone for Ai2 and the Olmo project. These aren’t huge models (more on that later), but it’s crucial for the viability of fully open-source models that they are competitive on performance – not just replications of models that came out 6 to 12 months ago. As always, all of our models come with full training data, code, intermediate checkpoints, training logs, and a detailed technical report. All are available today, with some more additions coming before the end of the year. As with OLMo 2 32B at its release, OLMo 3 32B is the best open-source language model ever released. It’s an awesome privilege to get to provide these models to the broader community researching... Перейти на оригинальный пост
Pubblicato 20 nov
Hugging Face (Twitter) RT @askOkara: this is all you need to run the best open-source models
Pubblicato 20 nov
Hugging Face (Twitter) RT @multimodalart: PRO for PROs Nano Banana PRO is available at no cost for @huggingface PRO subscribers on Spaces, go bananas 🍌
Pubblicato 20 nov
Hugging Face (Twitter) RT @ClementDelangue: Received mine last week. Expect ~90 days for shipping if you’re ordering now given the volume of orders we got. I know it’s long but it’s always going to be first come for serve though so earlier you order, faster it will come! https://twitter.com/CjTruheart/status/1991538265632244214#m
Pubblicato 20 nov
Hugging Face (Twitter) BOOM! Olmo 3 has just landed, join us in this livestream to learn more about the release 🤗💗https://twitter.com/allen_ai/status/1991525367887327407#m
Pubblicato 20 nov
Hugging Face (Twitter) RT @andimarafioti: Just dropped: 🎉 NVIDIA Nemotron-Parse v1.1 Next-gen OCR for parsing PDFs & PPTs into structured, machine-ready output (text + bounding boxes + semantic classes). Ready for commercial use and to generate datasets🚀 Check the examples on Hugging face! https://huggingface.co/nvidia/NVIDIA-Nemotron-Parse-v1.1
Pubblicato 20 nov
Hugging Face (Twitter) RT @eliebakouch: happy olmo day for those who celebrate!!! https://twitter.com/allen_ai/status/1991507983881379896#m
Pubblicato 20 nov
Hugging Face (Twitter) RT @allen_ai: Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵
Pubblicato 20 nov
Hugging Face (Twitter) RT @ClementDelangue: My interview with @danprimack for the @axios dealmaker summit yesterday 0:44 We are in an LLM bubble 🫧🫧 2:13 Does it create risks for AI in general 2:57 Why is open-source important? 4:33 Why haven't we raised recently? 6:45 Is China winning not only in open-source but also hardware? 11:57 In 3 years, most of the cloud revenue will be AI-related 12:41 If I was in charge of US policy, what would I do? 13:31 will we see the first major protest against AI in 2026 17:57 One prediction for 2026 🌶️🌶️🌶️ 18:40 Why we picked the 🤗 emoji (and what other terrible names @julien_c wanted to pick) It was fun, we should do a long-form interview in Miami @danprimack!
Pubblicato 20 nov
Hugging Face (Twitter) RT @ClementDelangue: New research from @MIT! The following is going to change in my opinion as more people and companies realize the advantages of open models: "Closed models dominate, with on average 80% of monthly LLM tokens using closed models despite much higher prices - on average 6x the price of open models - and only modest performance advantages. Frontier open models typically reach performance parity with frontier closed models within months, suggesting relatively fast convergence. Nevertheless, users continue to select closed models even when open alternatives are cheaper and offer superior performance. This systematic underutilization is economically significant: reallocating demand from observably dominated closed models to superior open models would reduce average prices by over 70% and, when extrapolated to the total market, generate an estimated $24.8 billion in additional consumer savings across 2025. These results suggest that closed... Перейти на оригинальный пост
Pubblicato 20 nov
Hugging Face (Twitter) RT @MaziyarPanahi: wow! this tiny 1.5B model is now trending #1 on @huggingface! 😱https://twitter.com/MaziyarPanahi/status/1990724701488824552#m