allenai / olmocrLinks
Toolkit for linearizing PDFs for LLM datasets/training
☆14,236Updated last week
Alternatives and similar repositories for olmocr
Users that are interested in olmocr are comparing it to the libraries listed below
Sorting:
- OCR & Document Extraction using vision models☆11,882Updated 5 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,893Updated 8 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,730Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,788Updated 9 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆29,230Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,027Updated 3 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆46,581Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆23,723Updated last month
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆71,470Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,198Updated 8 months ago
- The python library for real-time communication☆4,343Updated last month
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,405Updated 3 weeks ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆11,959Updated 3 weeks ago
- Fully local web research and report writing assistant☆8,213Updated 2 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,646Updated 3 months ago
- ☆7,913Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆19,130Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆7,990Updated 3 weeks ago
- 🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data☆63,051Updated this week
- ⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-p…☆12,266Updated this week
- ⚡ Easiest no code web data extraction platform • Instantly turn any website into API or spreadsheet ⚡☆13,719Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,709Updated 4 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,751Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,299Updated 5 months ago
- 🖥️ Run AI Agent in your browser.☆15,026Updated last month
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,538Updated 3 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,768Updated last month
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆19,152Updated last week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,923Updated 2 weeks ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆28,652Updated this week