datalab-to / chandraLinks
OCR model that handles complex tables, forms, handwriting with full layout.
☆2,821Updated last week
Alternatives and similar repositories for chandra
Users that are interested in chandra are comparing it to the libraries listed below
Sorting:
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,787Updated 2 weeks ago
- Build, enrich, and transform datasets using AI models with no code☆1,564Updated last month
- A fully open-source, LlamaCloud-backed alternative to NotebookLM☆1,615Updated 3 months ago
- Python library for Agentic Document Extraction from LandingAI☆2,154Updated last month
- ContextGem: Effortless LLM extraction from documents☆1,718Updated last week
- Python package and backend for the Elysia platform app.☆1,814Updated this week
- ☆2,156Updated 2 weeks ago
- 📑 PageIndex: Document Index for Reasoning-based RAG☆3,997Updated last week
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs☆753Updated this week
- ☆427Updated last month
- xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhere☆772Updated last week
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,438Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,684Updated 3 weeks ago
- 🔥 Open Source Perplexity like AI search engine with real-time citations, streaming responses, and live data powered by Firecrawl☆1,734Updated 3 months ago
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,809Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,370Updated 6 months ago
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆607Updated 3 weeks ago
- Communicate with an LLM provider using a single interface☆1,365Updated this week
- Open-Source Memory Engine for LLMs, AI Agents & Multi-Agent Systems☆4,380Updated last week
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,203Updated last week
- Semantic search and document parsing tools for the command line☆1,455Updated last week
- A curated list of 100+ libraries and frameworks for AI engineers building with LLMs☆2,206Updated last week
- A production-ready template to kickstart your Generative AI projects with structure and scalability in mind.☆684Updated 5 months ago
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆4,379Updated last week
- Add long-term memory to any AI in minutes. Self-hosted, open, and framework-free.☆1,889Updated last week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,801Updated 2 months ago
- ☆1,024Updated last month
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with int…☆1,027Updated 3 weeks ago
- 🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution☆1,880Updated last month
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆745Updated last month