OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆33,889Jun 12, 2026Updated this week
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- #1 PDF Application on GitHub that lets you edit PDFs on any device anywhere☆80,957Updated this week
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆67,596Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,387Mar 25, 2026Updated 2 months ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆82,075Jun 12, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆20,840Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tesseract Open Source OCR Engine (main repository)☆74,774Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆36,101Jun 6, 2026Updated last week
- Python tool for converting files and office documents to Markdown.☆152,866May 26, 2026Updated 3 weeks ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆29,615Dec 5, 2025Updated 6 months ago
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.☆116,088Jun 11, 2026Updated last week
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆45,221Nov 20, 2025Updated 6 months ago
- A community-supported supercharged document management system: scan, index and archive all your documents☆42,111Updated this week
- Production-ready platform for agentic workflow development.☆145,133Updated this week
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆173,937Jun 12, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The API to search, scrape, and interact with the web at scale. 🔥☆132,865Updated this week
- Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience☆61,557Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆99,362Updated this week
- OCR & Document Extraction using vision models☆12,238May 20, 2025Updated last year
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆192,138Jun 12, 2026Updated last week
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆34,882May 25, 2026Updated 3 weeks ago
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆82,621Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆68,704Jun 4, 2026Updated 2 weeks ago
- 🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire …☆78,678Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Get your documents ready for gen AI☆61,672Updated this week
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆35,119Mar 26, 2026Updated 2 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆141,711Updated this week
- An open-source cross-platform alternative to AirDrop☆83,531Jun 5, 2026Updated 2 weeks ago
- A feature-rich command-line audio/video downloader☆170,676Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆102,585Apr 15, 2026Updated 2 months ago
- 🔥 The open-source no-code platform for web scraping, crawling, search and AI data extraction • Turn websites into structured APIs in min…☆15,866Jun 11, 2026Updated last week
- There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creati…☆69,478Updated this week
- A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.☆44,488Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Free, simple, and intuitive online database diagram editor and SQL generator.☆37,400Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆28,370Sep 30, 2025Updated 8 months ago
- real time face swap and one-click video deepfake with only a single image☆93,898Updated this week
- Comfortably monitor your Internet traffic 🕵️♂️☆39,338Updated this week
- Open Source Continuous File Synchronization☆85,419Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,915Apr 13, 2026Updated 2 months ago
- Open-source, self-hosted note-taking tool built for quick capture. Markdown-native, lightweight, and fully yours.☆60,888Updated this week