Contenuto
Hugging Face (Twitter) RT @ClementDelangue: Granite Docling by @IBM is #3 trending on @huggingface. This is a multimodal Image-Text-to-Text model engineered for efficient document conversion. It preserves the core features of Docling while maintaining seamless integration with DoclingDocuments to ensure full compatibility. It builds upon the IDEFICS3 architecture, but introduces two key modifications: it replaces the vision encoder with siglip2-base-patch16-512 and substitutes the language model with a Granite 165M LLM. Try out our Granite-Docling-258 demo today. License: Apache 2.0 Granite-docling-258M is fully integrated into the Docling pipelines, carrying over existing features while introducing a number of powerful new features, including: 🔢 Enhanced Equation Recognition: More accurate detection and formatting of mathematical formulas 🧩 Flexible Inference Modes: Choose between full-page inference, bbox-guided region inference 🧘 Improved Stability: Tends to avoid... Перейти на оригинальный пост