#python#ai#ocr
Chandra OCR 2 is a top OCR model that turns images and PDFs into structured Markdown, HTML, or JSON, keeping layout, tables, math, handwriting, and 90+ languages accurate—it leads benchmarks like olmOCR (85.9% overall) and multilingual tests (77.8% average). Install easily with `pip install chandra-ocr` for CLI use, local HuggingFace, or fast vLLM server; try the free playground first. You benefit by quickly digitizing complex docs with high precision, saving time on extraction and enabling easy editing or analysis without manual fixes.
https://github.com/datalab-to/chandra
Image to Text OCR is a utility website made by Alejandro Akbal for extracting text from any image using #OCR.
This tool was made for those moments where you take a photo of some text and wish you could have it digitally.
https://github.com/AlejandroAkbal/Image-to-Text-OCR
Online: https://image-to-text-ocr.netlify.app/
https://github.com/tesseract-ocr/tesseract
This package contains an #OCR (Optical character recognition) engine - libtesseract and a command line program - tesseract.
The lead developer is Ray Smith. The maintainer is Zdenko Podobny. For a list of contributors see AUTHORS and github's log of contributors.
#Tesseract has unicode (UTF-8) support, and can recognize more than 100 languages "out of the box". It can be trained to recognize other languages. See Tesseract Training for more information.
Tesseract supports various output formats: plain-text, hocr(html), pdf.
This project does not include a GUI application. If you need one, please see the 3rdParty wiki page.
You should note that in many cases, in order to get better OCR results, you'll need to improve the quality of the image you are giving Tesseract.
Maid - Mobile Artificial Intelligence Distribution
Maid is a cross-platform free and an open-source application for interfacing with llama.cpp models locally, and remotely with Ollama, Mistral, Google Gemini and OpenAI models remotely.
-Choose from A wide range of models that runs LOCALLY and access remote models via api key!
-Text based output
-Image Generation (Selected Models only)
-No video or short clips generation yet
-Voice generation on selected models (Not tested)
-Setting model parameters
-Setting system prompt (Making the model behave/generate output in a certain way).
-And more.
Get it on
Github - https://github.com/Mobile-Artificial-Intelligence/maid/releases/latest
Fdroid - https://f-droid.org/packages/com.danemadsen.maid/
Spystore - https://play.google.com/store/apps/details?id=com.danemadsen.maid
*Don't clear CACHE OF THE APP AND EXCLUDE IT FROM SYSTEM'S AUTO CACHE CLEANING as app stores everything in device cache*
Follow @nogoolag and @libreware for more
#ai
Cherry Studio
Cherry Studio is a desktop client for Windows, Mac and Linux, which supports many LLM providers, including large cloud services and local models.
Among its main functions is the ability to work with more than 300 pre -designed #AI assistants, the creation of custom assistants, as well as support for various formats of documents, including text, images and office files.
The application offers tools for global search, top management and translating, which significantly improves interaction with the user thanks to the cross -platform and many settings options.
https://github.com/cherryhq/cherry-studio
LibreChat AI
Open-source platform that allows users to chat and interact with various #AI models through a unified interface. You can use OpenAI, Gemini, Anthropic and other AI models using their API. You may also use Ollama as an endpoint and use LibreChat to interact with local LLMs. It can be installed locally or deployed on a server.
LibreChat is designed to be highly customizable and supports a wide range of AI providers and services. Let me summarize its main features:
Free and Open Source: Accessible to everyone without any costs.
Customization: Offers extensive options to tailor the platform to individual preferences.
Multi-AI Support: Integrates with numerous AI models and services.
Unified Interface: Provides a consistent experience for interacting with different AI models.
https://www.librechat.ai
https://itsfoss.com/librechat-linux/
Jan.ai
https://jan.ai
A platform that enables you to run self-hosted local #AI. Jan provides an OpenAI-equivalent API server at localhost:1337 that can be used as a drop-in replacement with compatible apps.
With Jan, you can:
-Run open-source LLMs locally or connect to cloud AIs like ChatGPT or Google.
-Search the web and databases.
Integrate AI with everyday tools to work on your behalf (with permission).
-Customize and add features with Extensions.
Jan is opinionated software about what AI should be.