ocrmypdf / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆29,506Updated this week
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- Toolkit for linearizing PDFs for LLM datasets/training☆13,006Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆36,799Updated this week
- Tesseract Open Source OCR Engine (main repository)☆67,674Updated 3 weeks ago
- OCR & Document Extraction using vision models☆11,468Updated last month
- Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languag…☆50,836Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆100,205Updated this week
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆29,040Updated this week
- A browser extension for automating your browser by connecting blocks☆18,783Updated 3 weeks ago
- A simple screen parsing tool towards pure vision based GUI agent☆22,487Updated 3 months ago
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.☆91,354Updated this week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆110,835Updated this week
- Open-Source No Code Web Data Extraction Platform • Turn Websites To APIs & Spreadsheets In Minutes!☆13,102Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆33,330Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆26,105Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆40,551Updated this week
- best way to save what you love☆34,492Updated this week
- An AI Hedge Fund Team☆36,948Updated this week
- The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Mon…☆24,553Updated this week
- Production-ready platform for agentic workflow development.☆104,441Updated this week
- Comfortably monitor your Internet traffic 🕵️♂️☆24,505Updated this week
- Python tool for converting files and office documents to Markdown.☆59,444Updated 3 weeks ago
- SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither track…☆19,854Updated this week
- LLM inference in C/C++☆82,170Updated this week
- A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It…☆7,907Updated this week
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆11,236Updated this week
- A modern, open-source, self-hosted knowledge management and note-taking platform designed for privacy-conscious users and organizations.☆42,009Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆7,449Updated this week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆13,296Updated last month
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆35,459Updated this week
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆25,797Updated 4 months ago