allenai / olmocrLinks
Toolkit for linearizing PDFs for LLM datasets/training
☆16,058Updated last week
Alternatives and similar repositories for olmocr
Users that are interested in olmocr are comparing it to the libraries listed below
Sorting:
- OCR & Document Extraction using vision models☆11,968Updated 6 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆49,494Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,022Updated 9 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆48,786Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,172Updated last week
- Python tool for converting files and office documents to Markdown.☆83,302Updated last week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,790Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆23,930Updated 2 months ago
- An open-source RAG-based tool for chatting with your documents.☆24,676Updated 4 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆30,047Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,234Updated 9 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,942Updated last month
- 🚀 The fast, Pythonic way to build MCP servers and clients☆20,555Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆68,321Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,476Updated 2 months ago
- Easily build AI systems with Evals, RAG, Agents, fine-tuning, synthetic data, and more.☆4,420Updated last week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,973Updated 10 months ago
- The python library for real-time communication☆4,420Updated this week
- Turn any website into clean, contextualized data pipelines for your workflows☆13,919Updated last week
- A lightweight LMM-based Document Parsing Model☆6,286Updated last week
- Get your documents ready for gen AI☆45,259Updated this week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆19,580Updated last week
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆24,741Updated this week
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,827Updated 3 weeks ago
- ☆8,259Updated 2 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,596Updated 4 months ago
- Build Real-Time Knowledge Graphs for AI Agents☆20,476Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆56,514Updated this week
- Yet Another Document Translator☆5,960Updated last week
- 🪄 Create rich visualizations with AI☆14,389Updated last week