TGTGInsightintelligence telegramLIVE / telegram public index
Contenuto del post
Contenuto
Hugging Face (Twitter) RT @Baidu_Inc: Qianfan-VL, Baidu AI Cloud's vision-language model series, is now open source! Designed for enterprise-level applications, these multimodal models combine robust general capabilities with advanced performance in OCR and math problem-solving. Key features: > Three model sizes (3B, 8B, 70B) with 32K context length for diverse needs > Chain-of-thought reasoning in 8B/70B for strong performance in chart understanding, math, and visual logic > Four-stage progressive training pipeline for improved cross-modal alignment and domain enhancement > High-precision data synthesis pipeline across documents, math, charts, tables, formulas, and OCR tasks Discover more about Qianfan-VL ↓