ocrmypdf / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆29,669Updated this week
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,751Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆38,345Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆13,124Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆26,360Updated this week
- Open Source Continuous File Synchronization☆73,399Updated this week
- OCR & Document Extraction using vision models☆11,514Updated last month
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.☆92,034Updated this week
- 🔥 No Code Web Data Extraction Platform • Open-Source Alternative To Octoparse, ParseHub 🔥☆13,183Updated this week
- Python tool for converting files and office documents to Markdown.☆59,876Updated last month
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆29,658Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆7,523Updated this week
- Comfortably monitor your Internet traffic 🕵️♂️☆25,578Updated this week
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆63,046Updated this week
- A community-supported supercharged document management system: scan, index and archive all your documents☆28,963Updated this week
- Tesseract Open Source OCR Engine (main repository)☆67,926Updated last month
- Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languag…☆51,249Updated this week
- Integrate the DeepSeek API into popular softwares☆33,114Updated last month
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆114,863Updated this week
- A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It…☆8,016Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆101,564Updated this week
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Doc…☆25,533Updated last week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆47,109Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆23,015Updated this week
- Elegant reading of real-time and hottest news☆11,824Updated last week
- Web Extension for saving a faithful copy of a complete web page in a single HTML file☆18,608Updated 2 months ago
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆35,668Updated last week
- Yet Another Document Translator☆4,503Updated this week
- Open source Python library for converting PDF to DOCX.☆3,009Updated last month
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆46,116Updated this week
- AI-Powered Photos App for the Decentralized Web 🌈💎✨☆37,833Updated this week