ocrmypdf / OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆27,405Updated 2 weeks ago
Alternatives and similar repositories for OCRmyPDF:
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆55,989Updated this week
- OCR & Document Extraction using vision models☆10,962Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆89,708Updated this week
- 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSe…☆58,897Updated this week
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/…☆20,355Updated 3 weeks ago
- Toolkit for linearizing PDFs for LLM datasets/training☆11,187Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆30,946Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,155Updated this week
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆34,183Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervision☆80,270Updated 3 months ago
- Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, …☆24,704Updated this week
- A browser extension for automating your browser by connecting blocks☆16,690Updated last week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆137,814Updated this week
- Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing contr…☆62,164Updated this week
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆14,214Updated last week
- Tesseract Open Source OCR Engine (main repository)☆66,229Updated 3 weeks ago
- 🔥 🔥 🔥 Open Source Airtable Alternative☆53,763Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆26,348Updated 6 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆24,187Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,461Updated 2 months ago
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆82,954Updated this week
- real time face swap and one-click video deepfake with only a single image☆50,311Updated this week
- Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, m…☆92,934Updated this week
- Focalboard is an open source, self-hosted alternative to Trello, Notion, and Asana.☆23,203Updated 6 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆13,374Updated this week
- SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither track…☆18,459Updated this week
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆48,469Updated this week
- Comfortably monitor your Internet traffic 🕵️♂️☆23,369Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆31,877Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆35,829Updated this week