datalab-to / suryaLinks
OCR, layout analysis, reading order, table recognition in 90+ languages
☆17,882Updated this week
Alternatives and similar repositories for surya
Users that are interested in surya are comparing it to the libraries listed below
Sorting:
- Convert PDF to markdown + JSON quickly with high accuracy☆26,856Updated this week
- OCR & Document Extraction using vision models☆11,620Updated 2 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆13,346Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,745Updated 5 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,004Updated 2 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,236Updated 6 months ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆43,735Updated this week
- Python tool for converting files and office documents to Markdown.☆69,708Updated last month
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,366Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆22,933Updated 4 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,714Updated 5 months ago
- SOTA Open Source TTS☆22,510Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,045Updated 5 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆17,031Updated 3 weeks ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,656Updated last month
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆48,375Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,035Updated this week
- Faster Whisper transcription with CTranslate2☆17,260Updated last month
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆19,653Updated 3 months ago
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,546Updated 5 months ago
- Using GPT to parse PDF☆3,483Updated 3 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆11,757Updated last week
- An open-source RAG-based tool for chatting with your documents.☆22,871Updated 3 weeks ago
- ☆8,552Updated last year
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,506Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆40,963Updated this week
- Python scraper based on AI☆20,595Updated 3 weeks ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.☆4,708Updated this week
- ML-powered speech recognition directly in your browser☆2,999Updated 10 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,343Updated 4 months ago