Parse PDFs into markdown using Vision LLMs
☆462Oct 4, 2025Updated 5 months ago
Alternatives and similar repositories for vision-parse
Users that are interested in vision-parse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆51Dec 30, 2024Updated last year
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,501Aug 27, 2025Updated 6 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆32,910Mar 10, 2026Updated 2 weeks ago
- NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extracti…☆2,885Updated this week
- OCR & Document Extraction using vision models☆12,191May 20, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Oct 24, 2024Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,025Dec 8, 2025Updated 3 months ago
- Improved file parsing for LLM’s☆3,153Nov 13, 2024Updated last year
- Turn local files into a prompt for an LLM☆177Jan 19, 2025Updated last year
- Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.☆2,133Dec 15, 2025Updated 3 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆227Dec 24, 2024Updated last year
- Multi-agent that helps you organize and write documents.☆354Nov 15, 2024Updated last year
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,503Jan 3, 2025Updated last year
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,346Feb 21, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,582Jan 20, 2025Updated last year
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,280Sep 8, 2024Updated last year
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- Get your documents ready for gen AI☆56,339Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,043Mar 17, 2026Updated last week
- ☆49Sep 11, 2025Updated 6 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,741Nov 7, 2025Updated 4 months ago
- Knowledge Agents and Management in the Cloud☆4,248Mar 16, 2026Updated last week
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,635Mar 10, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to…☆2,314Nov 9, 2024Updated last year
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆4,964Mar 2, 2026Updated 3 weeks ago
- Deep research agent to help you find the best GitHub repositories 🕵️!☆860Nov 20, 2025Updated 4 months ago
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,504Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,506Mar 1, 2026Updated 3 weeks ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆56,760Updated this week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆234Mar 19, 2026Updated last week
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆498Jul 23, 2025Updated 8 months ago
- CLI that uses DSPy to interact with MCP servers.☆24Mar 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆227Jan 13, 2026Updated 2 months ago
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆91Mar 20, 2025Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,142Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,735Nov 1, 2025Updated 4 months ago
- Using GPT to parse PDF☆3,563Apr 17, 2025Updated 11 months ago
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,438Jan 12, 2026Updated 2 months ago