Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
☆682May 13, 2026Updated last week
Alternatives and similar repositories for Versatile-OCR-Program
Users that are interested in Versatile-OCR-Program are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📚 discover story relationships☆352Apr 28, 2026Updated 3 weeks ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,780Apr 27, 2026Updated 3 weeks ago
- The most accurate document search and store for building AI apps☆3,598May 11, 2026Updated last week
- Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs☆2,924Mar 22, 2026Updated last month
- Fully neural approach for text chunking☆409Oct 23, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Toolkit for linearizing PDFs for LLM datasets/training☆17,319Mar 25, 2026Updated last month
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆363May 21, 2025Updated 11 months ago
- PDF to markdown using vision LLMs — tables, layouts, and structure preserved☆891Feb 21, 2026Updated 2 months ago
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.☆683Jul 7, 2025Updated 10 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆892Dec 10, 2025Updated 5 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,751Dec 21, 2024Updated last year
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,348Jun 9, 2025Updated 11 months ago
- OCR & Document Extraction using vision models☆12,230May 20, 2025Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆283Mar 2, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Detect and extract tables to markdown and csv☆755Jan 24, 2025Updated last year
- ☆898May 13, 2025Updated last year
- A cache for AI agents to learn and replay complex behaviors.☆761Jun 15, 2025Updated 11 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,942Apr 9, 2026Updated last month
- Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what…☆332Feb 9, 2025Updated last year
- Secretary is an AI-powered tool that analyzes social media content from specified accounts and delivers results via WeChat. It supports c…☆358Aug 4, 2025Updated 9 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆227Dec 24, 2024Updated last year
- Great claude skills of everyone.☆37Nov 11, 2025Updated 6 months ago
- ☆10Feb 14, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Transcribe PDFs with local LLMs☆819Jan 27, 2026Updated 3 months ago
- Have a natural, spoken conversation with AI!☆3,686Jul 11, 2025Updated 10 months ago
- NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extracti…☆2,923May 13, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,730May 6, 2026Updated 2 weeks ago
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…☆1,095Aug 9, 2025Updated 9 months ago
- ☆273Nov 15, 2024Updated last year
- A self-hosted API that takes a URL and returns a file with browser screenshots.☆1,183Mar 9, 2025Updated last year
- Animating R1's thoughts.☆381Feb 17, 2025Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆35,144May 5, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆451Nov 24, 2025Updated 5 months ago
- ☆104Apr 1, 2025Updated last year
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆2,121Jan 20, 2025Updated last year
- A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office…☆8,303Updated this week
- A Python library to inspect and modify the internal structure of a PDF file☆1,011Aug 17, 2025Updated 9 months ago
- Pragmatic framework to build LLM Copilots☆64Mar 11, 2025Updated last year
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆541Nov 3, 2025Updated 6 months ago