datalab-to / suryaLinks
OCR, layout analysis, reading order, table recognition in 90+ languages
☆19,060Updated 2 months ago
Alternatives and similar repositories for surya
Users that are interested in surya are comparing it to the libraries listed below
Sorting:
- Convert PDF to markdown + JSON quickly with high accuracy☆30,780Updated this week
- OCR & Document Extraction using vision models☆12,015Updated 7 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,047Updated 11 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,071Updated last year
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,776Updated 3 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,256Updated 10 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,609Updated 8 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,811Updated 10 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆16,582Updated last week
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,779Updated 10 months ago
- Python scraper based on AI☆22,142Updated 2 weeks ago
- Get your documents ready for gen AI☆49,476Updated this week
- The unified stack for running systems of agents: framework, runtime and control plane.☆36,666Updated this week
- An open-source RAG-based tool for chatting with your documents.☆24,833Updated 6 months ago
- Automate browser based workflows with AI☆20,054Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,544Updated this week
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆32,206Updated 2 weeks ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆16,370Updated last month
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆7,363Updated this week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆51,675Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,504Updated 5 months ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆52,990Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,309Updated last month
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆35,755Updated 8 months ago
- Improved file parsing for LLM’s☆3,147Updated last year
- A simple screen parsing tool towards pure vision based GUI agent☆24,140Updated 3 months ago
- tiny vision language model☆9,163Updated last month
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆70,955Updated this week
- We write your reusable computer vision tools. 💜☆36,270Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆50,491Updated this week