Post #1375

@huggingface

Hugging Face

Visualizzazioni33Numero di visualizzazioni

Pubblicato22 set22/09/2025, 15:54

Contenuto del post

Contenuto

Hugging Face (Twitter) RT @Baidu_Inc: Qianfan-VL, Baidu AI Cloud's vision-language model series, is now open source! Designed for enterprise-level applications, these multimodal models combine robust general capabilities with advanced performance in OCR and math problem-solving. Key features: > Three model sizes (3B, 8B, 70B) with 32K context length for diverse needs > Chain-of-thought reasoning in 8B/70B for strong performance in chart understanding, math, and visual logic > Four-stage progressive training pipeline for improved cross-modal alignment and domain enhancement > High-precision data synthesis pipeline across documents, math, charts, tables, formulas, and OCR tasks Discover more about Qianfan-VL ↓