ocrmypdf / OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆20,609Updated 2 weeks ago
Alternatives and similar repositories for OCRmyPDF:
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
- Convert PDF to markdown + JSON quickly with high accuracy☆22,647Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,637Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆30,951Updated this week
- The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Mon…☆22,552Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆6,676Updated this week
- Open Source Continuous File Synchronization☆68,180Updated this week
- "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, A…☆49,273Updated this week
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆33,155Updated last week
- Tesseract Open Source OCR Engine (main repository)☆65,217Updated last month
- The OS for your personal finances☆42,108Updated this week
- A community-supported supercharged version of paperless: scan, index and archive all your physical documents☆25,635Updated this week
- Get your documents ready for gen AI☆23,643Updated this week
- 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and mor…☆23,392Updated last month
- OCR & Document Extraction using vision models☆10,249Updated this week
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆53,862Updated this week
- Port of OpenAI's Whisper model in C/C++☆38,362Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆72,760Updated 2 weeks ago
- Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!☆9,114Updated this week
- D2 is a modern diagram scripting language that turns text to diagrams.☆19,959Updated last week
- Windows inside a Docker container.☆33,517Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆41,035Updated this week
- The easiest, most secure way to use WireGuard and 2FA.☆21,433Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆25,847Updated 5 months ago
- Web Extension for saving a faithful copy of a complete web page in a single HTML file☆17,026Updated this week
- Simple bookmark manager built with Go☆10,017Updated this week
- ⬛️ CLI tool and library for saving complete web pages as a single HTML file☆13,062Updated this week
- Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, m…☆81,099Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,540Updated 2 weeks ago
- Self-hosted AI coding assistant☆30,380Updated this week