bytedance / DolphinLinks
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆5,811Updated 3 weeks ago
Alternatives and similar repositories for Dolphin
Users that are interested in Dolphin are comparing it to the libraries listed below
Sorting:
- "RAG-Anything: All-in-One RAG System"☆5,051Updated last week
- ContextGem: Effortless LLM extraction from documents☆1,500Updated 2 weeks ago
- 📄🧠 PageIndex: Document Index for Reasoning-based RAG☆2,590Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,742Updated 3 weeks ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆4,466Updated 2 weeks ago
- 🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library☆2,391Updated this week
- Python library for Agentic Document Extraction from LandingAI☆1,910Updated this week
- ☆2,031Updated 6 months ago
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆3,665Updated this week
- Eigent is the World's First Multi-agent Workforce to Unlock Your Exceptional Productivity.☆1,990Updated this week
- Implementation of my RAG system that won all categories in Enterprise RAG Challenge 2☆1,874Updated 4 months ago
- The most accurate document search and store for building AI apps☆3,235Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆15,376Updated this week
- Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)☆2,140Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,264Updated 4 months ago
- II-Agent: a new open-source framework to build and deploy intelligent agents☆2,877Updated 3 weeks ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆7,316Updated 3 weeks ago
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solvin…☆2,511Updated last month
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,858Updated last month
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,405Updated 2 weeks ago
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆12,388Updated this week
- ☆8,866Updated 3 weeks ago
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆3,863Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,321Updated last week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,883Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,598Updated last month
- A system for agentic LLM-powered data processing and ETL☆2,889Updated last week
- Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for ch…☆2,179Updated this week
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆693Updated 6 months ago
- ☆7,773Updated 2 weeks ago