VikParuchuri / surya
OCR, layout analysis, reading order, table recognition in 90+ languages
☆14,240Updated this week
Related projects ⓘ
Alternatives and complementary repositories for surya
- Convert PDF to markdown quickly with high accuracy☆17,845Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆18,840Updated this week
- Get your documents ready for gen AI☆9,923Updated this week
- Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory☆18,263Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,985Updated this week
- PDF to Markdown with vision models☆6,324Updated this week
- Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆15,471Updated this week
- Python scraper based on AI☆15,802Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆15,527Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week
- An open-source RAG-based tool for chatting with your documents.☆17,436Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆6,053Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆13,436Updated 3 weeks ago
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆15,608Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,347Updated last week
- 🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper☆16,247Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆17,775Updated this week
- DSPy: The framework for programming—not prompting—language models☆18,885Updated this week
- The Memory layer for your AI apps☆22,875Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆23,277Updated this week
- We write your reusable computer vision tools. 💜☆24,236Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆10,475Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆5,648Updated 2 weeks ago
- rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get…☆9,010Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.☆27,384Updated this week
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆11,989Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,734Updated last week
- Go ahead and axolotl questions☆7,930Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆15,283Updated this week
- tiny vision language model☆5,760Updated this week