datalab-to / chandraLinks
OCR model that handles complex tables, forms, handwriting with full layout.
☆4,787Updated 3 weeks ago
Alternatives and similar repositories for chandra
Users that are interested in chandra are comparing it to the libraries listed below
Sorting:
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,798Updated last month
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆9,882Updated this week
- AirLLM 70B inference with single 4GB GPU☆2,573Updated 5 months ago
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆14,072Updated 2 weeks ago
- Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.☆2,354Updated this week
- A fully open-source, LlamaCloud-backed alternative to NotebookLM☆1,764Updated 5 months ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆7,139Updated last month
- Python package and backend for the Elysia platform app.☆1,876Updated last week
- ☆2,276Updated 2 months ago
- A curated list of 100+ libraries and frameworks for AI engineers building with LLMs☆2,531Updated 2 months ago
- An local, offline (after initial setup), portable OCR software that can process images and PDF files, using DeepSeek-OCR AI (running dire…☆666Updated 2 weeks ago
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,727Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆94Updated 5 months ago
- ContextGem: Effortless LLM extraction from documents☆1,777Updated last month
- ☆2,112Updated 10 months ago
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs☆878Updated this week
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,706Updated last week
- Context Data Platform for AI Agents☆2,906Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,851Updated 5 months ago
- A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.☆1,989Updated 3 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,448Updated 9 months ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,070Updated last week
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,975Updated last month
- A production-ready template to kickstart your Generative AI projects with structure and scalability in mind.☆877Updated 7 months ago
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with int…☆1,335Updated 3 months ago
- A quick vibe coded app for deepseek OCR☆1,720Updated 2 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,623Updated 3 months ago
- xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhere☆856Updated 3 months ago
- Controllable and fast Text-to-Speech for over 7000 languages!☆323Updated 7 months ago
- Awesome-Arabic-AI is a curated, professional-grade repository designed to be the central hub for the best open-source Arabic AI resources…☆241Updated last week