ocrmypdf / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆31,146Updated this week
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥☆56,708Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆43,723Updated last week
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆66,977Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆14,091Updated this week
- OCR & Document Extraction using vision models☆11,828Updated 3 months ago
- Tesseract Open Source OCR Engine (main repository)☆69,615Updated last month
- ⚡ Easiest no code web data extraction platform • Instantly turn any website into API or spreadsheet ⚡☆13,621Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,557Updated last week
- An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.☆98,352Updated this week
- Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languag…☆53,563Updated last week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆27,902Updated 11 months ago
- Python tool for converting files and office documents to Markdown.☆73,124Updated last week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,613Updated 8 months ago
- 🧡 Follow everything in one place☆33,885Updated this week
- Integrate the DeepSeek API into popular softwares☆33,809Updated last week
- A browser extension for automating your browser by connecting blocks☆19,979Updated 3 weeks ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆8,007Updated last week
- Convert PDF to markdown + JSON quickly with high accuracy☆28,654Updated last week
- A collection of MCP servers.☆69,974Updated last week
- A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local …☆7,464Updated this week
- SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither track…☆21,896Updated this week
- A modern, open-source, self-hosted knowledge management and note-taking platform designed for privacy-conscious users and organizations.☆44,304Updated this week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆14,958Updated 3 months ago
- No fortress, purely open ground. OpenManus is Coming.☆49,803Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,593Updated 2 weeks ago
- 🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique styl…☆15,487Updated 2 weeks ago
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆30,923Updated 2 weeks ago
- A simple screen parsing tool towards pure vision based GUI agent☆23,512Updated this week
- Collection of publicly available IPTV channels from all over the world☆97,096Updated this week
- Yet Another Document Translator☆5,212Updated this week