ocrmypdf / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆32,471Updated last week
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆53,776Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,228Updated this week
- OCR & Document Extraction using vision models☆12,070Updated 8 months ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆9,019Updated this week
- The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harne…☆71,913Updated this week
- Access your entire server infrastructure from your local desktop☆13,735Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆31,421Updated last week
- 所有小初高、大学PDF教材。☆64,834Updated 3 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆16,860Updated this week
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆31,695Updated 2 months ago
- 🤱🏻 Turn any webpage into a desktop app with one command.☆45,580Updated last week
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.☆107,101Updated this week
- best way to save what you love☆38,448Updated 2 weeks ago
- ✨ Turn websites into structured APIs & clean data pipelines in minutes ✨☆14,208Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,073Updated 11 months ago
- Powerful AI Client☆38,420Updated 3 weeks ago
- A simple screen parsing tool towards pure vision based GUI agent☆24,344Updated 4 months ago
- A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.☆41,122Updated this week
- #1 PDF Application on GitHub that lets you edit PDFs on any device anywhere☆73,727Updated this week
- 🧡 Folo is the AI Reader☆36,897Updated last week
- A self-hosted dashboard that puts all your feeds in one place☆31,733Updated last month
- SOTA Open Source TTS☆24,782Updated last week
- Tesseract Open Source OCR Engine (main repository)☆72,268Updated last month
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆27,325Updated 3 weeks ago
- AI Agent + Coding Agent + 300+ assistants: agentic AI desktop with autonomous coding, intelligent automation, and unified access to front…☆39,323Updated this week
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆70,442Updated this week
- Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes…☆30,168Updated this week
- 🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.☆42,931Updated 3 weeks ago
- pix2tex: Using a ViT to convert images of equations into LaTeX code.☆16,164Updated last year
- Animation engine for explanatory math videos☆84,253Updated 3 months ago