datalab-to / suryaLinks
OCR, layout analysis, reading order, table recognition in 90+ languages
☆18,730Updated last week
Alternatives and similar repositories for surya
Users that are interested in surya are comparing it to the libraries listed below
Sorting:
- Convert PDF to markdown + JSON quickly with high accuracy☆29,230Updated last week
- OCR & Document Extraction using vision models☆11,882Updated 5 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆14,236Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,893Updated 8 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,299Updated 5 months ago
- tiny vision language model☆8,814Updated 3 weeks ago
- Python scraper based on AI☆21,594Updated 2 weeks ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆46,581Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,788Updated 9 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,900Updated last month
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,760Updated 7 months ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,709Updated 4 months ago
- An open-source RAG-based tool for chatting with your documents.☆24,520Updated 3 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,198Updated 8 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆54,747Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆26,837Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,840Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆28,652Updated last week
- ⚡ Easiest no code web data extraction platform • Instantly turn any website into API or spreadsheet ⚡☆13,719Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,980Updated this week
- We write your reusable computer vision tools. 💜☆35,604Updated last week
- Automate browser-based workflows with LLMs and Computer Vision☆14,615Updated this week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆65,898Updated last week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆30,069Updated this week
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,015Updated this week
- SOTA Open Source TTS☆23,126Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,646Updated 3 months ago
- Get your documents ready for gen AI☆41,754Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆47,063Updated this week
- MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone☆22,085Updated 3 weeks ago