datalab-to / chandraLinks
OCR model that handles complex tables, forms, handwriting with full layout.
☆3,137Updated 3 weeks ago
Alternatives and similar repositories for chandra
Users that are interested in chandra are comparing it to the libraries listed below
Sorting:
- ContextGem: Effortless LLM extraction from documents☆1,744Updated last month
- Python library for Agentic Document Extraction from LandingAI☆2,301Updated 3 weeks ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,910Updated this week
- A fully open-source, LlamaCloud-backed alternative to NotebookLM☆1,663Updated 4 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,595Updated last month
- Python package and backend for the Elysia platform app.☆1,835Updated last week
- ☆2,213Updated 2 weeks ago
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,489Updated last week
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,854Updated this week
- 🔥 Open Source Perplexity like AI search engine with real-time citations, streaming responses, and live data powered by Firecrawl☆1,758Updated 3 months ago
- OpenMemory gives AI agents real long-term memory. Not vector search. Not RAG. Actual memory.☆2,506Updated this week
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with int…☆1,082Updated last month
- xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhere☆775Updated last month
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,350Updated this week
- ☆1,057Updated last month
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs☆859Updated this week
- 📑 PageIndex: Document Index for Reasoning-based RAG☆4,305Updated last week
- 🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution☆1,945Updated last month
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,404Updated 7 months ago
- A curated list of 100+ libraries and frameworks for AI engineers building with LLMs☆2,309Updated last month
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆613Updated this week
- Semantic search and document parsing tools for the command line☆1,481Updated 3 weeks ago
- Self-hosted, multi-user API that drops bots into Google Meet for real-time transcripts.☆1,564Updated last week
- ☆429Updated last month
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆4,995Updated this week
- The most accurate document search and store for building AI apps☆3,417Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,872Updated last month
- A production-ready template to kickstart your Generative AI projects with structure and scalability in mind.☆792Updated 6 months ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,817Updated 3 months ago
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,894Updated this week