Post #2306

@huggingface

Hugging Face

Visualizzazioni143Numero di visualizzazioni

Pubblicato15 feb15/02/2026, 11:34

Contenuto del post

Contenuto

Hugging Face (Twitter) RT @_lewtun: We trained a tiny 4B model to reason for millions of tokens through IMO-level problems. Heaps excited to share our new blog post covering the full pipeline, from distilling the 🐳 to augmenting RL with a reasoning cache that unlocks extreme inference-time scaling for theorem proving. https://huggingface.co/spaces/lm-provers/qed-nano-blogpost