bytedance / DolphinLinks
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆7,910Updated this week
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,872Updated last month
- OCR model that handles complex tables, forms, handwriting with full layout.☆3,137Updated 3 weeks ago
- ContextGem: Effortless LLM extraction from documents☆1,744Updated last month
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆16,693Updated this week
- 📑 PageIndex: Document Index for Reasoning-based RAG☆4,305Updated last week
- Python library for Agentic Document Extraction from LandingAI☆2,301Updated 3 weeks ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆17,246Updated 3 weeks ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,817Updated 3 months ago
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,459Updated this week
- "RAG-Anything: All-in-One RAG Framework"☆11,126Updated last week
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,350Updated this week
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆4,995Updated this week
- The most accurate document search and store for building AI apps☆3,417Updated this week
- ☆2,074Updated 9 months ago
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,894Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆21,046Updated last week
- Eigent: The World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.☆2,551Updated this week
- The absolute trainer to light up AI agents.☆9,602Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,271Updated 2 months ago
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,489Updated last week
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,268Updated 2 weeks ago
- Agent S: an open agentic framework that uses computers like a human☆8,806Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,585Updated this week
- ⚙️ Create and run workflows (RPA 2.0)☆3,815Updated last week
- A research prototype of a human-centered web agent☆9,112Updated this week
- Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.☆4,599Updated 2 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,404Updated 7 months ago
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆10,492Updated 2 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,918Updated 2 months ago
- "Paper2Slides: From Paper to Presentation in One Click"☆2,112Updated this week