Post #1736

@huggingface

Hugging Face

Visualizzazioni16Numero di visualizzazioni

Pubblicato19 nov19/11/2025, 03:01

Contenuto del post

Contenuto

Hugging Face (Twitter) RT @RulinShao: 🔥Thrilled to introduce DR Tulu-8B, an open long-form Deep Research model that matches OpenAI DR 💪Yes, just 8B! 🚀 The secret? We present Reinforcement Learning with Evolving Rubrics (RLER) for long-form non-verifiable DR tasks! Our rubrics: - co-evolve with the policy model - are grounded on search knowledge 🧵