Post #1626

@huggingface

Hugging Face

Visualizzazioni13Numero di visualizzazioni

Pubblicato30 ott30/10/2025, 18:15

Contenuto del post

Contenuto

Hugging Face (Twitter) RT @_lewtun: We've just published the Smol Training Playbook: a distillation of hard earned knowledge to share exactly what it takes to train SOTA LLMs ⚡️ Featuring our protagonist SmolLM3, we cover: 🧭 Strategy on whether to train your own LLM and burn all your VC money 🪨 Pretraining, aka turning a mountain of text into a fancy auto-completer 🗿How to sculpt base models with post-training alchemy 🛠️ The underlying infra and how to debug your way out of NCCL purgatory Highlights from the post-training chapter in the thread 👇