allenai / olmocr
Toolkit for linearizing PDFs for LLM datasets/training
☆10,641Updated this week
Alternatives and similar repositories for olmocr:
Users that are interested in olmocr are comparing it to the libraries listed below
- OCR & Document Extraction using vision models☆10,683Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆32,762Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆5,048Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,314Updated last month
- A simple screen parsing tool towards pure vision based GUI agent☆21,127Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,418Updated 4 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆5,917Updated last month
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆29,104Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆23,393Updated this week
- 🪄 Create rich visualizations with AI☆10,956Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆23,891Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,412Updated this week
- Use your locally running AI models to assist you in your web browsing☆6,083Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,979Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆47,110Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆6,198Updated this week
- Fully local web research and report writing assistant☆6,669Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆3,910Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆3,264Updated this week
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆17,134Updated last month
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆3,725Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,250Updated 2 months ago
- Vision agent☆4,420Updated this week
- A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.☆9,517Updated this week
- Get your documents ready for gen AI☆25,450Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆14,647Updated this week
- An open-source RAG-based tool for chatting with your documents.☆21,795Updated last month
- A collection of MCP servers.☆15,963Updated this week
- The python library for real-time communication☆3,355Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆7,258Updated this week