allenai / olmocrLinks
Toolkit for linearizing PDFs for LLM datasets/training
☆13,196Updated this week
Alternatives and similar repositories for olmocr
Users that are interested in olmocr are comparing it to the libraries listed below
Sorting:
- OCR & Document Extraction using vision models☆11,543Updated last month
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,490Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆26,471Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆42,825Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,716Updated 5 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,560Updated 4 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆22,605Updated 3 months ago
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆7,438Updated last week
- ☆6,653Updated last month
- The python library for real-time communication☆4,115Updated last week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆38,938Updated this week
- The Open All-in-One Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.☆15,126Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,767Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆9,985Updated 3 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,933Updated 2 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆65,023Updated this week
- Suna - Open Source Generalist AI Agent☆16,501Updated this week
- Fully local web research and report writing assistant☆7,796Updated 2 weeks ago
- Build Real-Time Knowledge Graphs for AI Agents☆12,727Updated this week
- Your AI Operator for Web, Android, Automation & Testing.☆9,595Updated this week
- Use your locally running AI models to assist you in your web browsing☆6,830Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆47,851Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆59,587Updated this week
- Full-stack framework for building Multi-Agent Systems with memory, knowledge and reasoning.☆29,710Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,588Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,287Updated last week
- Vision agent☆4,934Updated last week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,605Updated last month
- A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local …☆6,612Updated last week
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆5,252Updated last month