bytedance / DolphinLinks
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆5,719Updated this week
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- "RAG-Anything: All-in-One RAG System"☆4,383Updated 2 weeks ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆3,914Updated last week
- ContextGem: Effortless LLM extraction from documents☆1,477Updated last week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,680Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,817Updated 2 weeks ago
- Python library for Agentic Document Extraction from LandingAI☆1,846Updated this week
- The most accurate document search and store for building AI apps☆3,152Updated last week
- 📄🧠 PageIndex: Document Index for Reasoning-based RAG☆2,148Updated this week
- 🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library☆2,209Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,255Updated 4 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆5,824Updated 2 months ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆14,065Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,815Updated last month
- ☆2,003Updated 5 months ago
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆8,318Updated 2 months ago
- ☆8,516Updated last week
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,387Updated 2 weeks ago
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆3,523Updated last week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,798Updated last week
- Data transformation framework for AI. Ultra performant, with incremental processing.☆2,709Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,852Updated last month
- Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)☆1,825Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,510Updated 3 weeks ago
- ☆2,699Updated 4 months ago
- SOTA search powered LLM☆3,529Updated 4 months ago
- II-Agent: a new open-source framework to build and deploy intelligent agents☆2,859Updated last week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆3,252Updated this week
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆2,006Updated 3 months ago
- Eigent is the World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.☆1,638Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,164Updated this week