deepseek-ai/DeepSeek-OCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deepseek-ai/DeepSeek-OCR)

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

☆23,689

Alternatives and similar repositories for DeepSeek-OCR

Users that are interested in DeepSeek-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

deepseek-ai / DeepSeek-OCR-2
View on GitHub
Visual Causal Flow
☆3,189Feb 3, 2026Updated 5 months ago
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆86,350Updated this week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,317Updated this week
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,677Jan 30, 2026Updated 5 months ago
karpathy / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆56,667Jul 4, 2026Updated 3 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,965Updated this week
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,741Feb 27, 2026Updated 5 months ago
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,875Updated this week
google / langextract
View on GitHub
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…
☆37,890Updated this week
openclaw / openclaw
View on GitHub
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
☆384,316Updated this week
hiyouga / LlamaFactory
View on GitHub
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆73,550Updated this week
studio-dots-ai / dots.ocr
View on GitHub
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆9,031Mar 24, 2026Updated 4 months ago
openai / codex
View on GitHub
Lightweight coding agent that runs in your terminal
☆101,844Updated this week
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,198Mar 25, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
anomalyco / opencode
View on GitHub
The open source coding agent.
☆190,170Updated this week
google-gemini / gemini-cli
View on GitHub
An open-source AI agent that brings the power of Gemini directly into your terminal.
☆106,209Updated this week
langgenius / dify
View on GitHub
Build Agentic workflows, RAG pipelines, with rich AI model and tool support on one collaborative workspace. Deploy on cloud, VPC, or self…
☆150,431Updated this week
QwenLM / Qwen3
View on GitHub
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆27,432Jan 9, 2026Updated 6 months ago
microsoft / markitdown
View on GitHub
Python tool for converting files and office documents to Markdown.
☆169,411Updated this week
infiniflow / ragflow
View on GitHub
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…
☆86,041Updated this week
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,841Updated this week
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,795Updated this week
MoonshotAI / Kimi-K2
View on GitHub
Kimi K2 is the large language model series developed by Moonshot AI team
☆11,044Jan 21, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.2, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆176,993Updated this week
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,756Updated this week
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆121,787Updated this week
browser-use / browser-use
View on GitHub
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
☆106,990Updated this week
openai / gpt-oss
View on GitHub
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
☆20,265Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,688Updated this week
microsoft / graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆34,912Updated this week
FoundationAgents / OpenManus
View on GitHub
No fortress, purely open ground. OpenManus is Coming.
☆57,661Feb 11, 2026Updated 5 months ago
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆50,574Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NousResearch / hermes-agent
View on GitHub
The agent that grows with you
☆221,282Updated this week
deepseek-ai / DeepSeek-R1
View on GitHub
☆91,973Jun 27, 2025Updated last year
anthropics / claude-code
View on GitHub
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing rout…
☆139,292Updated this week
QwenLM / Qwen-Agent
View on GitHub
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆16,855Mar 4, 2026Updated 4 months ago
bytedance / deer-flow
View on GitHub
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…
☆77,964Updated this week
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,835Updated this week
anthropics / skills
View on GitHub
Public repository for Agent Skills
☆164,536Updated this week