datalab-to / chandraLinks
OCR model that handles complex tables, forms, handwriting with full layout.
☆4,260Updated 3 weeks ago
Alternatives and similar repositories for chandra
Users that are interested in chandra are comparing it to the libraries listed below
Sorting:
- ContextGem: Effortless LLM extraction from documents☆1,750Updated 3 weeks ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,952Updated last week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,828Updated 4 months ago
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,503Updated last week
- 📑 PageIndex: Document Index for Reasoning-based RAG☆4,506Updated 2 weeks ago
- A fully open-source, LlamaCloud-backed alternative to NotebookLM☆1,710Updated 4 months ago
- ☆2,250Updated last month
- A curated list of 100+ libraries and frameworks for AI engineers building with LLMs☆2,353Updated last month
- Build, enrich, and transform datasets using AI models with no code☆1,612Updated 2 months ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,443Updated 3 weeks ago
- Python package and backend for the Elysia platform app.☆1,852Updated 3 weeks ago
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,888Updated 3 weeks ago
- 🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution☆1,988Updated 2 months ago
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with int…☆1,114Updated 2 months ago
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs☆872Updated this week
- Python library for Agentic Document Extraction from LandingAI☆2,315Updated 3 weeks ago
- ☆2,080Updated 9 months ago
- "RAG-Anything: All-in-One RAG Framework"☆11,885Updated last week
- ☆1,077Updated 2 months ago
- Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)☆3,547Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,410Updated 8 months ago
- A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.☆1,354Updated 3 weeks ago
- 🔥 Open Source Perplexity like AI search engine with real-time citations, streaming responses, and live data powered by Firecrawl☆1,776Updated 4 months ago
- Data platform for context engineering. Context data platform that stores, observes and learns. Join the community❤️: https://discord.acon…☆2,173Updated this week
- Implementation of 17+ agentic architectures designed for practical use across different stages of AI system development.☆2,211Updated 3 months ago
- Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.☆2,828Updated this week
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,443Updated this week
- Semantic search and document parsing tools for the command line☆1,512Updated last month
- Running Docling as an API service☆1,094Updated 3 weeks ago
- Communicate with an LLM provider using a single interface☆1,534Updated this week