allenai / olmocrLinks
Toolkit for linearizing PDFs for LLM datasets/training
☆12,482Updated this week
Alternatives and similar repositories for olmocr
Users that are interested in olmocr are comparing it to the libraries listed below
Sorting:
- A simple screen parsing tool towards pure vision based GUI agent☆22,258Updated 2 months ago
- OCR & Document Extraction using vision models☆11,232Updated last week
- 🚀 The fast, Pythonic way to build MCP servers and clients☆11,359Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆16,646Updated this week
- Suna - Open Source Generalist AI Agent☆13,503Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,074Updated last week
- ☆6,194Updated last week
- A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.☆14,216Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆8,967Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆9,878Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆3,504Updated this week
- Get your documents ready for gen AI☆30,684Updated this week
- The python library for real-time communication☆3,970Updated last week
- A lightweight, powerful framework for multi-agent workflows☆10,722Updated last week
- ⚡️ Open Source No Code Web Data Extraction Platform • Turn Websites To APIs & Spreadsheets In Minutes ⚡️☆12,829Updated this week
- Fully local web research and report writing assistant☆7,489Updated 2 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆14,176Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆46,082Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,581Updated 3 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,265Updated last week
- 🖥️ Run AI Agent in your browser.☆13,323Updated last week
- Your AI Operator for Web, Android, Automation & Testing.☆8,983Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆14,834Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆21,673Updated last week
- 🪄 Create rich visualizations with AI☆12,166Updated 2 weeks ago
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆53,553Updated this week
- An open-source RAG-based tool for chatting with your documents.☆22,347Updated last month
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆38,945Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,439Updated last week
- The official Python SDK for Model Context Protocol servers and clients☆13,211Updated this week