datalab-to / chandraLinks
OCR model that handles complex tables, forms, handwriting with full layout.
☆4,787Updated 3 weeks ago
Alternatives and similar repositories for chandra
Users that are interested in chandra are comparing it to the libraries listed below
Sorting:
- AirLLM 70B inference with single 4GB GPU☆2,573Updated 5 months ago
- A quick vibe coded app for deepseek OCR☆1,720Updated 2 months ago
- An local, offline (after initial setup), portable OCR software that can process images and PDF files, using DeepSeek-OCR AI (running dire…☆666Updated 2 weeks ago
- Context Data Platform for AI Agents☆2,906Updated this week
- Build, enrich, and transform datasets using AI models with no code☆1,623Updated 3 months ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,774Updated last month
- On-device TTS model by Neuphonic☆4,768Updated last week
- A curated list of 100+ libraries and frameworks for AI engineers building with LLMs☆2,531Updated 2 months ago
- Python package and backend for the Elysia platform app.☆1,876Updated last week
- ☆2,271Updated 2 months ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆7,139Updated last month
- ContextGem: Effortless LLM extraction from documents☆1,777Updated last month
- Camera monitoring with VLM☆1,314Updated last week
- RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal…☆9,882Updated this week
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,706Updated last week
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with int…☆1,335Updated 3 months ago
- Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.☆3,217Updated 2 weeks ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,070Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,448Updated 9 months ago
- A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.☆1,989Updated 3 weeks ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,851Updated 5 months ago
- 🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution☆2,079Updated 3 months ago
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,727Updated this week
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs☆878Updated this week
- Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.☆2,354Updated this week
- ☆995Updated last month
- ☆1,089Updated 3 months ago
- PageLM is a community driven version of NotebookLM & a education platform that transforms study materials into interactive resources like…☆1,361Updated 2 months ago
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆14,072Updated 2 weeks ago
- A fully open-source, LlamaCloud-backed alternative to NotebookLM☆1,764Updated 5 months ago