bytedance / DolphinLinks
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆8,091Updated 3 weeks ago
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,952Updated last week
- "RAG-Anything: All-in-One RAG Framework"☆11,885Updated last week
- 📑 PageIndex: Document Index for Reasoning-based RAG☆4,506Updated 2 weeks ago
- OCR model that handles complex tables, forms, handwriting with full layout.☆4,142Updated 2 weeks ago
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,443Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,379Updated 2 months ago
- Python library for Agentic Document Extraction from LandingAI☆2,315Updated 3 weeks ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,828Updated 4 months ago
- ContextGem: Effortless LLM extraction from documents☆1,750Updated 2 weeks ago
- Eigent: The World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.☆2,681Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆19,990Updated last week
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆7,997Updated this week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,505Updated 2 weeks ago
- The absolute trainer to light up AI agents.☆10,018Updated last week
- Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!☆5,681Updated this week
- MCP Toolbox for Databases is an open source MCP server for databases.☆12,173Updated this week
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,937Updated this week
- Build, enrich, and transform datasets using AI models with no code☆1,612Updated 2 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,410Updated 8 months ago
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,503Updated this week
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆11,337Updated last month
- A research prototype of a human-centered web agent☆9,544Updated 2 weeks ago
- 🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, …☆2,778Updated 5 months ago
- Memory for AI Agents in 6 lines of code☆10,721Updated this week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆17,190Updated this week
- ☆2,080Updated 9 months ago
- A system for agentic LLM-powered data processing and ETL☆3,355Updated last week
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,314Updated 2 weeks ago
- Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.☆4,724Updated this week
- ☆1,374Updated last week