bytedance / DolphinLinks
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆5,391Updated last month
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- ContextGem: Effortless LLM extraction from documents☆1,416Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆1,510Updated this week
- "RAG-Anything: All-in-One RAG System"☆2,385Updated last week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,601Updated last month
- Python library for Agentic Document Extraction from LandingAI☆1,719Updated last week
- ☆1,894Updated 4 months ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆10,100Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆5,729Updated last month
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,214Updated 3 months ago
- 📄 🧠 PageIndex: Document Index System for Reasoning-based RAG☆1,139Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,730Updated last week
- 🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library☆1,989Updated this week
- ☆7,294Updated this week
- 🤖 A visualization mcp contains 25+ visual charts using @antvis. Using for chart generation and data analysis.☆2,348Updated last week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,342Updated 3 weeks ago
- Modern Backend Framework that unifies APIs, background jobs, workflows, and AI agents into a single cohesive system with built-in observa…☆5,665Updated this week
- Eigent is the World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.☆1,317Updated this week
- A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…☆816Updated last week
- Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for ch…☆1,932Updated this week
- An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework.…☆1,388Updated last month
- OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex lay…☆2,077Updated last week
- ☆2,184Updated this week
- ☆3,489Updated 4 months ago
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆1,920Updated 3 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,758Updated last week
- A research prototype of a human-centered web agent☆6,868Updated this week
- Implementation of my RAG system that won all categories in Enterprise RAG Challenge 2☆1,777Updated 3 months ago
- II-Agent: a new open-source framework to build and deploy intelligent agents☆2,826Updated this week
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,343Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,774Updated last month