datalab-to / suryaLinks
OCR, layout analysis, reading order, table recognition in 90+ languages
☆17,641Updated last week
Alternatives and similar repositories for surya
Users that are interested in surya are comparing it to the libraries listed below
Sorting:
- Convert PDF to markdown + JSON quickly with high accuracy☆25,975Updated this week
- Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆34,513Updated this week
- OCR & Document Extraction using vision models☆11,350Updated last month
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,656Updated 4 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆12,940Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆40,227Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆22,357Updated 3 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,862Updated last month
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆24,159Updated this week
- Python scraper based on AI☆20,007Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,911Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆49,721Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,598Updated last month
- Distribute and run LLMs with a single file.☆22,633Updated last month
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆35,508Updated this week
- We write your reusable computer vision tools. 💜☆26,768Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆45,908Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆45,460Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆22,426Updated 2 months ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,586Updated last week
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,493Updated 3 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,073Updated 2 weeks ago
- Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.☆13,011Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆7,865Updated 5 months ago
- Automate browser-based workflows with LLMs and Computer Vision☆13,584Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,629Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,731Updated 2 weeks ago
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆21,953Updated this week
- Get your documents ready for gen AI☆31,854Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆45,591Updated last week