bytedance / DolphinLinks
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆7,336Updated 2 weeks ago
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- "RAG-Anything: All-in-One RAG Framework"☆8,084Updated 2 weeks ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆4,963Updated this week
- Python library for Agentic Document Extraction from LandingAI☆2,090Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,764Updated last month
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆3,927Updated this week
- ☆9,125Updated last month
- ContextGem: Effortless LLM extraction from documents☆1,522Updated last week
- 📄🧠 PageIndex: Document Index for Reasoning-based RAG☆2,731Updated 3 weeks ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆7,522Updated this week
- mcp-use is the easiest way to interact with mcp servers with custom agents☆7,895Updated this week
- 🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library☆2,550Updated this week
- Eigent is the World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.☆2,251Updated this week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆13,652Updated this week
- Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.☆3,709Updated 2 weeks ago
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆3,725Updated this week
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆10,719Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆14,236Updated this week
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solvin…☆2,747Updated 2 weeks ago
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,313Updated 2 months ago
- The most accurate document search and store for building AI apps☆3,309Updated this week
- Implementation of my RAG system that won all categories in Enterprise RAG Challenge 2☆1,900Updated 5 months ago
- ☆4,144Updated this week
- MCP Toolbox for Databases is an open source MCP server for databases.☆10,910Updated this week
- Prompt Orchestration Markup Language☆4,637Updated this week
- Memory for AI Agents in 6 lines of code☆7,607Updated this week
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆3,125Updated this week
- Metorial MCP Containers - Containerized versions of hundreds of MCP servers 📡 🧠☆2,910Updated this week
- SOTA search powered LLM☆3,676Updated 6 months ago
- II-Agent: a new open-source framework to build and deploy intelligent agents☆2,903Updated last month
- Build, enrich, and transform datasets using AI models with no code☆1,510Updated this week