Nutlope/llama-ocr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Nutlope/llama-ocr)

Nutlope / llama-ocr

Document to Markdown OCR library with Llama 3.2 vision

☆2,428

Alternatives and similar repositories for llama-ocr

Users that are interested in llama-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

getomni-ai / zerox
View on GitHub
OCR & Document Extraction using vision models
☆12,259May 20, 2025Updated last year
CatchTheTornado / text-extract-api
View on GitHub
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…
☆3,141Dec 8, 2025Updated 7 months ago
Nutlope / llamacoder
View on GitHub
Open source Claude Artifacts – built with Llama 3.1 405B
☆7,031Updated this week
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,123Updated this week
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,510Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,154Feb 10, 2025Updated last year
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,134Mar 25, 2026Updated 3 months ago
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,310Updated this week
Cinnamon / kotaemon
View on GitHub
An open-source RAG-based tool for chatting with your documents.
☆25,571Jul 14, 2026Updated last week
imanoop7 / Ollama-OCR
View on GitHub
☆2,673Mar 17, 2025Updated last year
QuivrHQ / MegaParse
View on GitHub
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆7,400Feb 21, 2025Updated last year
Nutlope / napkins
View on GitHub
napkins.dev – from screenshot to app
☆1,474Jun 26, 2026Updated 3 weeks ago
Nutlope / logocreator
View on GitHub
A free + OSS logo generator powered by Flux on Together AI
☆7,056Jun 26, 2026Updated 3 weeks ago
Nutlope / turboseek
View on GitHub
An AI search engine inspired by Perplexity
☆1,646Jul 12, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,684Updated this week
browserbase / stagehand
View on GitHub
The SDK For Browser Agents
☆23,570Updated this week
Dicklesworthstone / llm_aided_ocr
View on GitHub
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs
☆2,943Mar 22, 2026Updated 3 months ago
Nutlope / blinkshot
View on GitHub
A realtime AI image generator
☆1,037Updated this week
unclecode / crawl4ai
View on GitHub
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
☆73,394Updated this week
zaidmukaddam / scira
View on GitHub
Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …
☆11,778Mar 20, 2026Updated 4 months ago
openai / swarm
View on GitHub
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
☆21,848Apr 15, 2026Updated 3 months ago
browser-use / browser-use
View on GitHub
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
☆105,710Updated this week
Nutlope / picMenu
View on GitHub
Visualize menus in seconds with AI
☆453Jul 12, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
janhq / ichigo
View on GitHub
Local realtime voice AI
☆2,490Nov 26, 2025Updated 7 months ago
langchain-ai / open-canvas
View on GitHub
📃 A better UX for chat, writing content, and coding with LLMs.
☆5,492Feb 25, 2026Updated 4 months ago
steel-dev / steel-browser
View on GitHub
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…
☆7,353Jul 12, 2026Updated last week
Marker-Inc-Korea / AutoRAG
View on GitHub
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
☆4,928Updated this week
Nutlope / llamatutor
View on GitHub
An AI personal tutor built with Llama 3.1
☆1,995Jul 12, 2026Updated last week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,541Updated this week
lavague-ai / LaVague
View on GitHub
Large Action Model framework to develop AI Web Agents
☆6,380Jan 21, 2025Updated last year
timescale / pgai
View on GitHub
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
☆5,813May 27, 2026Updated last month
ItzCrazyKns / Vane
View on GitHub
Vane is an AI-powered answering engine.
☆35,783Apr 11, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆50,962Updated this week
yigitkonur / api-llm-ocr
View on GitHub
PDF to markdown using vision LLMs — tables, layouts, and structure preserved
☆899Feb 21, 2026Updated 5 months ago
huggingface / smolagents
View on GitHub
🤗 smolagents: a barebones library for agents that think in code.
☆28,449Jul 14, 2026Updated last week
katanaml / sparrow
View on GitHub
Structured data extraction, instruction calling and agentic workflows with ML, LLM and Vision LLM
☆5,182Jun 30, 2026Updated 3 weeks ago
OpenHands / OpenHands
View on GitHub
🙌 OpenHands: AI-Driven Development
☆81,406Updated this week
ScrapeGraphAI / Scrapegraph-ai
View on GitHub
Python scraper based on AI
☆28,509Updated this week
coderamp-labs / gitingest
View on GitHub
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
☆15,204Updated this week