Parse PDFs into markdown using Vision LLMs
☆478Oct 4, 2025Updated 8 months ago
Alternatives and similar repositories for vision-parse
Users that are interested in vision-parse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆52Dec 30, 2024Updated last year
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,568Aug 27, 2025Updated 9 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆36,101Jun 6, 2026Updated last week
- OCR & Document Extraction using vision models☆12,238May 20, 2025Updated last year
- NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever Library …☆2,941Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Oct 24, 2024Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,104Dec 8, 2025Updated 6 months ago
- Improved file parsing for LLM’s☆3,162May 17, 2026Updated last month
- Turn local files into a prompt for an LLM☆175Jan 19, 2025Updated last year
- Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.☆2,140Dec 15, 2025Updated 6 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆227Dec 24, 2024Updated last year
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,387Feb 21, 2025Updated last year
- Multi-agent that helps you organize and write documents.☆352Nov 15, 2024Updated last year
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,728Jan 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,294Sep 8, 2024Updated last year
- Get your documents ready for gen AI☆61,672Updated this week
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆2,150Jan 20, 2025Updated last year
- ☆49Sep 11, 2025Updated 9 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆17,387Mar 25, 2026Updated 2 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,887Nov 7, 2025Updated 7 months ago
- Knowledge Agents and Management in the Cloud☆4,250May 18, 2026Updated last month
- Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition☆1,453Apr 19, 2026Updated last month
- GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to…☆2,317Nov 9, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,830Updated this week
- Deep research agent to help you find the best GitHub repositories 🕵️!☆881Jun 9, 2026Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆237Jun 3, 2026Updated 2 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆20,840Updated this week
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,651Updated this week
- CLI that uses DSPy to interact with MCP servers.☆24Mar 10, 2025Updated last year
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆497Jul 23, 2025Updated 10 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,800Nov 1, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆226Jan 13, 2026Updated 5 months ago
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆5,759Jun 6, 2026Updated last week
- Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.☆67,596Updated this week
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆94Mar 20, 2025Updated last year
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆32Dec 8, 2022Updated 3 years ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,755Dec 21, 2024Updated last year
- Structured data extraction, instruction calling and agentic workflows with ML, LLM and Vision LLM☆5,161Jun 11, 2026Updated last week