TGTGInsighttelegram intelligenceLIVE / telegram public index
Back to channels
GitHub Trends avatar

TGINSIGHT CHAT

GitHub Trends

@githubtrending

Technologies

See what the GitHub community is most excited about today. A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel. Author and maintainer: https://github.com/katursis

Subscribers1.0万Current channel subscribers
Tracked posts1,000Indexed post count
Recent reach3,917Sum of recent post views
Recent posts

Recent posts

Tag: #ocr · 7 posts

当前筛选 #ocr清除筛选

Posted Apr 2

#python#glm#image2text#ocr GLM-OCR is a top 0.9B-parameter model for accurate OCR on complex documents like tables, code, formulas, seals, and receipts, scoring 94.62 on OmniDocBench V1.5. Install via `pip install glmocr`, use cloud API (no GPU needed) or self-host with vLLM/SGLang for fast, low-cost inference, and get JSON/Markdown outputs easily via CLI or Python. You benefit from quick, robust document parsing that saves time, cuts compute costs, and integrates simply into your apps for real-world tasks. https://github.com/zai-org/GLM-OCR

643 views

Posted Mar 26

#python#ai#ocr Chandra OCR 2 is a top OCR model that turns images and PDFs into structured Markdown, HTML, or JSON, keeping layout, tables, math, handwriting, and 90+ languages accurate—it leads benchmarks like olmOCR (85.9% overall) and multilingual tests (77.8% average). Install easily with `pip install chandra-ocr` for CLI use, local HuggingFace, or fast vLLM server; try the free playground first. You benefit by quickly digitizing complex docs with high precision, saving time on extraction and enabling easy editing or analysis without manual fixes. https://github.com/datalab-to/chandra

695 views

Posted Mar 19

#java#a11y#accessibility#ai#bounding_box#document_parsing#eaa#html#json#markdown#ocr#ocr_recognition#pdf#pdf_accessibility#pdf_converter#pdf_extraction#pdf_parser#pdf_ua#rag#tables#tagged_pdf OpenDataLoader PDF is a free, open-source tool (Apache 2.0) that tops benchmarks with 0.90 accuracy for extracting structured data like Markdown, JSON (with bounding boxes), and HTML from any PDF—digital, scanned, or complex with tables, formulas, charts, and OCR in 80+ languages. It runs locally on CPU (0.05s/page fast mode), filters AI prompt injections for safety, integrates with LangChain/RAG, and automates accessibility tagging to Tagged PDF. You save time and costs on parsing for AI pipelines or compliance (vs. $50–200/manual doc), getting precise, private results for better LLM apps and legal standards. https://github.com/opendataloader-project/opendataloader-pdf

639 views

Posted Feb 24

#rust#ai#ai_ocr#attention_mechanism#gnn#gnn_model#gnns#graph#graph_neural_networks#llm_inference#low_latency#mincut#neo4j#ocr#onnx#rust#vector#wasm RuVector is a free, open-source vector database that gets smarter with every query. Unlike static databases, it learns from usage via GNN layers, runs LLMs locally with no cloud costs, supports graph queries like Neo4j, scales freely across nodes, and deploys as a single self-booting file (125ms startup). Run with `npx ruvector`. You benefit from faster, more accurate AI search that improves automatically, zero operating costs, full offline/privacy control, and easy scaling—perfect for RAG, agents, or edge apps without vendor lock-in. https://github.com/ruvnet/ruvector

636 views

Posted Sep 24

#python#document_analysis#layout_analysis#ocr#parser#pdf#pdf_converter#pdf_parser#python#vlm_ocr Dolphin is a smart AI tool that can analyze and understand complex document images, like pages with text, tables, formulas, and pictures. It works in two steps: first, it figures out the layout and reading order of the page; then, it quickly parses each element using special prompts. This makes it fast and accurate for turning document images into structured data like JSON or Markdown. You can use pre-trained models and easy code to process single pages, PDFs, or specific elements. This helps you save time and effort when extracting information from complicated documents efficiently. https://github.com/bytedance/Dolphin

466 views

Posted Jun 28

#javascript#linux#macos#ocr#pot#pot_app#recognize#tauri#translate#translation#tts#windows Pot is a cross-platform translation tool that lets you quickly translate text by selecting it and using a shortcut, typing text to translate, or using OCR to translate text from screenshots. It supports many translation engines like OpenAI, Google, DeepL, and more, plus offline options. You can also add plugins to extend its features and use it on Windows, macOS, and Linux. Pot offers an API for integration with other software and works well even on Wayland systems. This makes translating easier, faster, and more flexible, helping you understand and work with multiple languages efficiently. https://github.com/pot-app/pot-desktop

343 views

Posted Jun 12

#typescript#anki#chatgpt#deepseek#electron#evernote#knowledge_base#local_first#markdown#note_taking#notes_app#notion#obsidian#ocr#ollama#openai#pdf#s3#self_hosted#webdav SiYuan is a privacy-first personal knowledge management tool. It allows you to organize your thoughts and notes in a secure way, even offline. You can use features like block-level references, Markdown editing, and mathematical formulas. It also supports AI tools and has apps for Android, iOS, and HarmonyOS. SiYuan is open source and free for most features, making it a great choice for managing your personal knowledge securely. https://github.com/siyuan-note/siyuan

495 views