TGTGInsightintelligence telegramLIVE / telegram public index
← Hugging Face
Hugging Face avatar

TGINSIGHT POST

Post #1532

@huggingface

Hugging Face

Visualizzazioni23Numero di visualizzazioni
Pubblicato18 ott18/10/2025, 16:35
Contenuto del post

Contenuto

Hugging Face (Twitter) RT @ClementDelangue: I love the diversity of trending open datasets these days. There’s no excuse anymore not to train your own models! - Fineweb and a shuffle of it by @karpathy - Webscale-RL, a large-scale reinforcement learning dataset from @salesforce - SVQ, an audio dataset from @Google - Awesome chatgpt prompts with almost 10,000 likes by @fkadev - A subset of the Math dataset by @DanHendrycks - Nemotron personas by @nvidia - An arabic dataset by @rightnowai_co - A curated dataset of 1.5M+ @github repositories - Toucan-1.5M, the largest fully synthetic tool-agent dataset - A scientific paper dataset from @arxiv - A cybersecurity dataset from @NIST by @ethanolivertroy These are just the current trending amongst over half a million public datasets on @huggingface! hf.co/datasets