Find similar content

@libreware · Post #1171 · 09/01/2023, 01:00 PM

Image to Text OCR is a utility website made by Alejandro Akbal for extracting text from any image using #OCR. This tool was made for those moments where you take a photo of some text and wish you could have it digitally. https://github.com/AlejandroAkbal/Image-to-Text-OCR Online: https://image-to-text-ocr.netlify.app/

Hashtags

#ocr

JJ.ai (NFA)🪽

@jsmjsmxyz · Post #1108 · 12/22/2020, 10:01 AM

#OCR#Tools Newlearner 的 OCR 使用分享（离线篇） 🔌Offline OCR🔌 离线的 OCR 工具主要依赖离线库，处理精度上可能比不上在线接口，但优点是可以进行大批量的 OCR 工作，且处理速度较快。 🔍OwlOCR - 支持对 PDF, PNG, JPEG, GIF 文件进行 OCR - 支持在 iOS 设备上拍照，OwlOCR 上立即进行 OCR 处理 - 离线 OCR 多语言支持，包括简体中文和繁体中文，但 - 免费版保留了大部分功能，付费版可以提高 OCR 处理速度 🔍TextSniper - 小巧轻量，使用方便 - 支持 OCR 结果叠加至剪切板 - 离线多语言支持 - 买断制 app，包含在 Setapp 订阅中 👀 以上提到的几款 OCR 工具都是在 Win/Mac 端使用的，至于移动端我比较推荐的是「白描」。我对 OCR 识别精度要求不高，因此使用的是 Bob 的免费接口；OCRmyPDF则是我扫描大型 PDF 文档时采取的方案。 🎗「天若 OCR」与「白描」即将迎来优惠促销活动，有需要的朋友们可以考虑入手。 📘 关联阅读： 1⃣️OCRmyPDF·给你的PDF文档添加文字层 2⃣️alfred-ocr：macOS 上的多接口 Alfred OCR / 翻译插件频道：@NewlearnerChannel

Hashtags

#ocr #tools

JJ.ai (NFA)🪽

@jsmjsmxyz · Post #1107 · 12/22/2020, 07:13 AM

#OCR#Tools Newlearner 的 OCR 使用分享（在线篇）通常在图片、PDF文档中提取文字，我们都会使用 OCR(Optical Character Recognition) 技术，今天就和大家分享一下几款比较优秀的 OCR 工具 ☁️Online OCR ☁️ 在线 OCR 大多是调用云 OCR 引擎进行处理，对得到的结果进行优化后再输出，所以精确度、还原度会更高。因为大多数 OCR 接口都需要付费，所以有一定的使用成本。 🔍iText - 使用 Google & 百度 & 腾讯 OCR 接口，识别精准度高 - 独创算法，优化识别结果 - 支持识别后翻译 - 每月免费体验20次，Pro 版支持月/年付订阅 🔍天若OCR - 一款 Windows 平台上的 OCR 工具 - 支持表格识别、竖排识别、LaTex 公式识别、翻译功能 - 支持自定义文本接口 - 提供免费版与付费版，付费版采取买断制 🔍Bob - 本质是一款翻译工具，但其附带的 OCR 功能可以满足日常使用 - 支持自定义文本接口，默认使用百度智能云 OCR 接口 - 半开源，免费 - Bob 的作者十分贴心，在使用文档中给出了各大 OCR 接口（百度、腾讯、搜狗、有道）的申请方式：教程地址频道：@NewlearnerChannel

Hashtags

#ocr #tools

djangoproject

@djangoproject · Post #245 · 01/28/2017, 01:04 PM

https://github.com/tesseract-ocr/tesseract This package contains an #OCR (Optical character recognition) engine - libtesseract and a command line program - tesseract. The lead developer is Ray Smith. The maintainer is Zdenko Podobny. For a list of contributors see AUTHORS and github's log of contributors. #Tesseract has unicode (UTF-8) support, and can recognize more than 100 languages "out of the box". It can be trained to recognize other languages. See Tesseract Training for more information. Tesseract supports various output formats: plain-text, hocr(html), pdf. This project does not include a GUI application. If you need one, please see the 3rdParty wiki page. You should note that in many cases, in order to get better OCR results, you'll need to improve the quality of the image you are giving Tesseract.

Hashtags

#ocr #tesseract

@libreware · Post #1484 · 09/05/2025, 10:34 AM

Maid - Mobile Artificial Intelligence Distribution Maid is a cross-platform free and an open-source application for interfacing with llama.cpp models locally, and remotely with Ollama, Mistral, Google Gemini and OpenAI models remotely. -Choose from A wide range of models that runs LOCALLY and access remote models via api key! -Text based output -Image Generation (Selected Models only) -No video or short clips generation yet -Voice generation on selected models (Not tested) -Setting model parameters -Setting system prompt (Making the model behave/generate output in a certain way). -And more. Get it on Github - https://github.com/Mobile-Artificial-Intelligence/maid/releases/latest Fdroid - https://f-droid.org/packages/com.danemadsen.maid/ Spystore - https://play.google.com/store/apps/details?id=com.danemadsen.maid *Don't clear CACHE OF THE APP AND EXCLUDE IT FROM SYSTEM'S AUTO CACHE CLEANING as app stores everything in device cache* Follow @nogoolag and @libreware for more #ai

Hashtags

@libreware · Post #1396 · 01/31/2025, 04:51 PM

Cherry Studio Cherry Studio is a desktop client for Windows, Mac and Linux, which supports many LLM providers, including large cloud services and local models. Among its main functions is the ability to work with more than 300 pre -designed #AI assistants, the creation of custom assistants, as well as support for various formats of documents, including text, images and office files. The application offers tools for global search, top management and translating, which significantly improves interaction with the user thanks to the cross -platform and many settings options. https://github.com/cherryhq/cherry-studio

Hashtags

@libreware · Post #1307 · 07/16/2024, 01:26 PM

LibreChat AI Open-source platform that allows users to chat and interact with various #AI models through a unified interface. You can use OpenAI, Gemini, Anthropic and other AI models using their API. You may also use Ollama as an endpoint and use LibreChat to interact with local LLMs. It can be installed locally or deployed on a server. LibreChat is designed to be highly customizable and supports a wide range of AI providers and services. Let me summarize its main features: Free and Open Source: Accessible to everyone without any costs. Customization: Offers extensive options to tailor the platform to individual preferences. Multi-AI Support: Integrates with numerous AI models and services. Unified Interface: Provides a consistent experience for interacting with different AI models. https://www.librechat.ai https://itsfoss.com/librechat-linux/

Hashtags

@libreware · Post #1280 · 04/09/2024, 12:34 PM

Jan.ai https://jan.ai A platform that enables you to run self-hosted local #AI. Jan provides an OpenAI-equivalent API server at localhost:1337 that can be used as a drop-in replacement with compatible apps. With Jan, you can: -Run open-source LLMs locally or connect to cloud AIs like ChatGPT or Google. -Search the web and databases. Integrate AI with everyday tools to work on your behalf (with permission). -Customize and add features with Extensions. Jan is opinionated software about what AI should be.

Hashtags

JJ.ai (NFA)🪽

@jsmjsmxyz · Post #995 · 04/09/2020, 08:07 AM

#Github情报#OCR OCRmyPDF 给你的PDF文档添加文字层 Github | WiKi OCRmyPDF将OCR文本层添加到扫描的PDF文件中，从而可以对其进行搜索或复制粘贴。 ✨ 特点 - 使用强大的开源 Tesseract OCR引擎识别，支持100多种语言 - 调用全部可用CPU资源进行OCR（耗电警告⚠️ - 从常规PDF生成可搜索的PDF文件 - 优化PDF尺寸，生成比输入文件小的文件 - 在执行OCR之前对图像进行歪斜校正和/或清洁 🔍部署 - 支持多种操作系统 Linux, Win, macOS … - 支持 brew install ocrmypdf 但需要自己安装语言库 - macOS 一键安装脚本（努力更新中 - 可配合 Alfred / Launchabr 制作成 Workflow 使用 👀 没有文字层的PDF文献/文档真的难受，OCRmyPDF的扫描精准度虽然说不是特别高，但有了文字层，我们就可以方便的在文档里做标注了～频道：@NewlearnerChannel

Hashtags

#github情报 #ocr

DPS Build

@dps_build · Post #83 · 03/17/2023, 07:24 AM

朱老师用一系列 AI 工具创作了一本童书，总共花了二十小时。当然，他也坦言，因为自己是设计师，所以懂排版；因为之前出过书，所以了解整个出版流程。如果没有这些经验，恐怕远远不止二十小时。他用到的工具： ChatGPT3.5, New Bing, Midjourney V4, Figma, Blurb. https://www.douban.com/note/846359765/ #ai

Hashtags