ocrmypdf / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆30,792Updated this week
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- Toolkit for linearizing PDFs for LLM datasets/training☆13,781Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆42,085Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆27,942Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆107,003Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,192Updated last week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆27,536Updated 10 months ago
- A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.☆36,545Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆47,987Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,344Updated 7 months ago
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆64,387Updated this week
- OCR & Document Extraction using vision models☆11,738Updated 2 months ago
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆150,223Updated this week
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆31,389Updated this week
- Integrate the DeepSeek API into popular softwares☆33,371Updated 3 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆23,265Updated 4 months ago
- Easiest no code web data extraction platform. Instantly turn any website into API or spreadsheet.☆13,479Updated last week
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆18,966Updated last month
- 🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。☆17,665Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆67,575Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆62,444Updated this week
- screen sharing for developers https://screego.net/☆9,102Updated last month
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆15,014Updated last week
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆36,196Updated last week
- Self-hosted AI coding assistant☆31,956Updated this week
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆4,809Updated last week
- 📂 Web File Browser☆30,803Updated last week
- There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creati…☆54,261Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,441Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆49,367Updated this week
- Convert PDF to HTML without losing text or format.☆5,133Updated last month