ocrmypdf / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆31,541Updated 2 weeks ago
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆61,694Updated this week
- Open source Python library for converting PDF to DOCX.☆3,142Updated 5 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆47,318Updated last week
- best way to save what you love☆36,992Updated 2 weeks ago
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆69,030Updated last week
- 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and mor…☆25,282Updated 5 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆29,367Updated last week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆15,163Updated 5 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆14,689Updated this week
- OCR & Document Extraction using vision models☆11,895Updated 5 months ago
- A community-supported supercharged document management system: scan, index and archive all your documents☆33,834Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,764Updated last week
- Elegant reading of real-time and hottest news☆13,397Updated last month
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆29,242Updated last week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,232Updated last year
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆34,620Updated this week
- An open-source cross-platform alternative to AirDrop☆69,477Updated this week
- NocoBase is the most extensible AI-powered no-code/low-code platform for building business applications and enterprise solutions.☆17,027Updated this week
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆38,976Updated 4 months ago
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.☆101,523Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆38,350Updated this week
- A new bootable USB solution.☆71,442Updated 2 months ago
- SOTA Open Source TTS☆23,558Updated last week
- Integrate the DeepSeek API into popular softwares☆34,210Updated last month
- There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creati…☆56,262Updated last week
- 🤱🏻 Turn any webpage into a desktop app with one command. 一键打包网页生成轻量桌面应用☆43,046Updated last week
- Free, simple, and intuitive online database diagram editor and SQL generator.☆34,387Updated this week
- A browser extension for automating your browser by connecting blocks☆20,370Updated last week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆64,319Updated last week
- 一款提示词优化器,助力于编写高质量的提示词☆16,376Updated last week