datalab-to/surya

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/datalab-to/surya)

datalab-to / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

☆21,147

Alternatives and similar repositories for surya

Users that are interested in surya are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,813Updated this week
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,157Feb 10, 2025Updated last year
getomni-ai / zerox
View on GitHub
OCR & Document Extraction using vision models
☆12,259May 20, 2025Updated last year
Cinnamon / kotaemon
View on GitHub
An open-source RAG-based tool for chatting with your documents.
☆25,587Jul 14, 2026Updated last week
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,726Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,171Mar 25, 2026Updated 4 months ago
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,409Updated this week
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,630Updated this week
stanford-oval / storm
View on GitHub
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
☆30,298Sep 30, 2025Updated 9 months ago
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆86,192Updated this week
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,012Apr 19, 2025Updated last year
roboflow / supervision
View on GitHub
We write your reusable computer vision tools. 💜
☆48,352Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,364Jun 9, 2026Updated last month
OpenBMB / MiniCPM-V
View on GitHub
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
☆25,981Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,832Updated this week
facebookresearch / nougat
View on GitHub
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆10,049Feb 21, 2025Updated last year
infiniflow / ragflow
View on GitHub
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…
☆85,907Updated this week
unclecode / crawl4ai
View on GitHub
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
☆74,835Updated this week
Unstructured-IO / unstructured
View on GitHub
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆15,192Updated this week
openinterpreter / openinterpreter
View on GitHub
A coding agent for open models like Kimi K3
☆67,239Jul 18, 2026Updated last week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,346Updated this week
Skyvern-AI / skyvern
View on GitHub
Automate browser based workflows with AI
☆22,581Updated this week
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,067Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ScrapeGraphAI / Scrapegraph-ai
View on GitHub
Python scraper based on AI
☆28,610Updated this week
QuivrHQ / MegaParse
View on GitHub
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆7,405Feb 21, 2025Updated last year
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,613Updated this week
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,806Jan 3, 2025Updated last year
vanna-ai / vanna
View on GitHub
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
☆23,812Feb 2, 2026Updated 5 months ago
OpenHands / OpenHands
View on GitHub
🙌 OpenHands: AI-Driven Development
☆81,960Updated this week
ItzCrazyKns / Vane
View on GitHub
Vane is an AI-powered answering engine.
☆35,856Apr 11, 2026Updated 3 months ago
microsoft / graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆34,808Updated this week
assafelovic / gpt-researcher
View on GitHub
An autonomous agent that conducts deep research on any data using any LLM providers
☆28,620Jul 18, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Mintplex-Labs / anything-llm
View on GitHub
Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience
☆63,784Updated this week
lobehub / lobehub
View on GitHub
🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire …
☆80,772Updated this week
khoj-ai / khoj
View on GitHub
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …
☆35,967Jun 24, 2026Updated last month
FlowiseAI / Flowise
View on GitHub
Build AI Agents, Visually
☆54,889Updated this week
microsoft / autogen
View on GitHub
A programming framework for agentic AI
☆59,939Apr 15, 2026Updated 3 months ago
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,606Updated this week
jina-ai / reader
View on GitHub
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
☆11,725May 22, 2026Updated 2 months ago