TGTGInsightintelligence telegramLIVE / telegram public index
Contenuto del post
Contenuto
Hugging Face (Twitter) RT @lvwerra: Public benchmarks lag behind what frontier labs are using internally to test and develop LLMs, yet they are the key driver of progress for LLMs. This needs to change! Excited to work with @SnorkelAI who are investing $3M do build out the evaluation ecosystem with the community.