imanoop7/Ollama-OCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/imanoop7/Ollama-OCR)

imanoop7 / Ollama-OCR

☆2,680

Alternatives and similar repositories for Ollama-OCR

Users that are interested in Ollama-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,207Mar 25, 2026Updated 4 months ago
CatchTheTornado / text-extract-api
View on GitHub
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…
☆3,150Dec 8, 2025Updated 7 months ago
enoch3712 / ExtractThinker
View on GitHub
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
☆1,587Aug 27, 2025Updated 11 months ago
studio-dots-ai / dots.ocr
View on GitHub
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆9,038Mar 24, 2026Updated 4 months ago
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,895Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
microsoft / data-formulator
View on GitHub
🪄 Data Formulator is an interactive AI-powered data analysis system makes it easy to connect, explore and visualize data.
☆15,986Updated this week
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,167Updated this week
datalab-to / chandra
View on GitHub
OCR model that handles complex tables, forms, handwriting with full layout.
☆11,793Jun 26, 2026Updated last month
bytedance / Dolphin
View on GitHub
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆9,041Mar 25, 2026Updated 4 months ago
ucbepic / docetl
View on GitHub
A system for agentic LLM-powered data processing and ETL
☆3,951Jul 21, 2026Updated last week
lumina-ai-inc / chunkr
View on GitHub
Vision infrastructure to turn complex documents into RAG/LLM-ready data
☆4,058Apr 9, 2026Updated 3 months ago
echohive42 / AI-reads-books-page-by-page
View on GitHub
AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…
☆2,294Jun 27, 2026Updated last month
transformerlab / transformerlab-app
View on GitHub
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…
☆5,166Updated this week
Nutlope / llama-ocr
View on GitHub
Document to Markdown OCR library with Llama 3.2 vision
☆2,429Jul 12, 2026Updated 2 weeks ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
google / langextract
View on GitHub
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…
☆37,911Updated this week
GitHamza0206 / simba
View on GitHub
OpenSource Production ready Customer service with built in Evals and monitoring
☆1,451Jun 18, 2026Updated last month
unclecode / crawl4ai
View on GitHub
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
☆75,339Updated this week
HKUDS / AutoAgent
View on GitHub
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
☆9,544Oct 16, 2025Updated 9 months ago
getomni-ai / zerox
View on GitHub
OCR & Document Extraction using vision models
☆12,258May 20, 2025Updated last year
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,970Jul 20, 2026Updated last week
QuivrHQ / MegaParse
View on GitHub
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆7,410Feb 21, 2025Updated last year
supavec / supavec
View on GitHub
The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.
☆1,151Dec 28, 2025Updated 7 months ago
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,208Feb 10, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Cinnamon / kotaemon
View on GitHub
An open-source RAG-based tool for chatting with your documents.
☆25,663Jul 14, 2026Updated 2 weeks ago
tjmlabs / ColiVara
View on GitHub
Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…
☆1,484Jul 8, 2026Updated 3 weeks ago
shcherbak-ai / contextgem
View on GitHub
ContextGem: Effortless LLM extraction from documents
☆1,864Updated this week
katanaml / sparrow
View on GitHub
Structured data extraction, instruction calling and agentic workflows with ML, LLM and Vision LLM
☆5,188Jun 30, 2026Updated 3 weeks ago
browser-use / browser-use
View on GitHub
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
☆107,106Updated this week
Canner / WrenAI
View on GitHub
GenBI (Generative BI) for AI agents, an open-source, governed text-to-SQL through an open context layer that turns natural-language quest…
☆16,701Updated this week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,965Updated this week
Nutlope / logocreator
View on GitHub
A free + OSS logo generator powered by Flux on Together AI
☆7,179Jun 26, 2026Updated last month
steel-dev / steel-browser
View on GitHub
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…
☆7,392Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
simstudioai / sim
View on GitHub
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
☆29,232Updated this week
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,472Updated this week
harishdeivanayagam / rowfill
View on GitHub
Open-source spreadsheets platform for deep research and document processing
☆368Sep 25, 2025Updated 10 months ago
browserbase / stagehand
View on GitHub
The SDK For Browser Agents
☆23,659Updated this week
open-webui / open-webui
View on GitHub
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
☆147,076Updated this week
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,993Updated this week
abus-aikorea / voice-pro
View on GitHub
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…
☆11,250Jul 13, 2026Updated 2 weeks ago