The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆8,852Dec 17, 2025Updated 2 months ago
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- Get your documents ready for gen AI☆54,094Feb 24, 2026Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆34,244Feb 25, 2026Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆23,192Updated this week
- Python tool for converting files and office documents to Markdown.☆88,637Feb 20, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Feb 24, 2026Updated last week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆7,876Feb 15, 2026Updated 2 weeks ago
- "RAG-Anything: All-in-One RAG Framework"☆13,867Updated this week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆55,275Updated this week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- 🪄 Create rich visualizations with AI☆15,069Feb 24, 2026Updated last week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆17,775Updated this week
- Universal memory layer for AI Agents☆47,994Feb 23, 2026Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,168Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆97,908Feb 25, 2026Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,342Feb 21, 2025Updated last year
- Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.☆2,371Feb 19, 2026Updated last week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,616Oct 16, 2025Updated 4 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆60,971Feb 25, 2026Updated last week
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆87,163Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,089Feb 10, 2025Updated last year
- A system for agentic LLM-powered data processing and ETL☆3,636Feb 2, 2026Updated last month
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,402Jan 3, 2025Updated last year
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆26,746Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,805Feb 22, 2026Updated last week
- An open-source SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skills and subagents,…☆20,843Updated this week
- The AI Browser Automation Framework☆21,261Updated this week
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆19,041Updated this week
- OCR & Document Extraction using vision models☆12,155May 20, 2025Updated 9 months ago
- ⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-p…☆14,472Feb 23, 2026Updated last week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆17,889Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆79,028Updated this week
- Kortix – build, manage and train AI Agents.☆19,418Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,862Aug 25, 2025Updated 6 months ago
- Official inference framework for 1-bit LLMs☆28,640Feb 3, 2026Updated last month