Contenuto
Hugging Face (Twitter) RT @HuggingPapers: Here's your recap of the hottest AI papers on @huggingface for September 1-7! This week, we dive into LLM comprehension, hallucination, robotics, and more: - Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth - From Editor to Dense Geometry Estimator - Open Data Synthesis For Deep Research (mentioning @Google Gemini) - Towards a Unified View of Large Language Model Post-Training - ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding - Why Language Models Hallucinate - Robix: A Unified Model for Robot Interaction, Reasoning and Planning (outperforming @OpenAI GPT-4o & @Google Gemini 2.5 Pro) - DeepResearch Arena: The First Exam of LLMs' Research Abilities