PDF to markdown using vision LLMs — tables, layouts, and structure preserved
☆890Feb 21, 2026Updated last month
Alternatives and similar repositories for api-llm-ocr
Users that are interested in api-llm-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OCR & Document Extraction using vision models☆12,193May 20, 2025Updated 10 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,081Dec 8, 2025Updated 4 months ago
- Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs☆2,905Mar 22, 2026Updated 2 weeks ago
- Open-source platform for extracting structured data from documents using AI.☆1,470May 15, 2025Updated 10 months ago
- ai for jq☆249Sep 20, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Open-Source Grammarly Alternative☆1,654Jun 17, 2025Updated 9 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,759Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,939Sep 24, 2025Updated 6 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆540Nov 3, 2025Updated 5 months ago
- ☆449Sep 18, 2024Updated last year
- Create mind maps to learn new things using AI.☆571Nov 2, 2024Updated last year
- ☆1,369Apr 18, 2025Updated 11 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆887Dec 10, 2025Updated 4 months ago
- GUI analyzer for deep-diving into PDF files. Detect malicious payloads, understand object relationships, and extract key information for …☆872Aug 22, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,341Jun 9, 2025Updated 10 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,557Apr 3, 2026Updated last week
- Convert PDF to markdown + JSON quickly with high accuracy☆33,352Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,425Jan 20, 2025Updated last year
- Open-source framework for exporting your personal data.☆1,473Dec 25, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.☆25,251Apr 3, 2026Updated last week
- Laminar - open-source observability platform purpose-built for AI agents. YC S24.☆2,757Updated this week
- Large Action Model framework to develop AI Web Agents☆6,311Jan 21, 2025Updated last year
- OpenCV+YOLO+LLAVA powered video surveillance system☆791Oct 21, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,282Mar 28, 2025Updated last year
- Things you can do with the token embeddings of an LLM☆1,454Dec 1, 2025Updated 4 months ago
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,629Jul 24, 2024Updated last year
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆682May 20, 2025Updated 10 months ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…☆6,773Apr 2, 2026Updated last week
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,200Dec 30, 2025Updated 3 months ago
- 🦾 Take control of your AI agents☆1,389Aug 22, 2025Updated 7 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,239Sep 11, 2025Updated 6 months ago
- A conversational, AI device + software framework for companionship, entertainment, education, healthcare, IoT applications, and DIY robot…☆543Mar 27, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A system for agentic LLM-powered data processing and ETL☆3,702Mar 27, 2026Updated 2 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,758Nov 1, 2025Updated 5 months ago
- Local realtime voice AI☆2,477Nov 26, 2025Updated 4 months ago
- grep for words with similar meaning to the query☆1,214Aug 19, 2024Updated last year
- Detect and extract tables to markdown and csv☆757Jan 24, 2025Updated last year
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,348Feb 21, 2025Updated last year
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆6,582Dec 5, 2025Updated 4 months ago