Princeton-AI2-Lab/DeepOCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Princeton-AI2-Lab/DeepOCR)

Princeton-AI2-Lab / DeepOCR

A reproduction of the Deepseek-OCR model including training

☆208

Alternatives and similar repositories for DeepOCR

Users that are interested in DeepOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆16Aug 5, 2025Updated 11 months ago
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
buyukakyuz / parlance
View on GitHub
Decentralized peer-to-peer messaging with bootstrap discovery, direct connections, and no centralized message infrastructure.
☆82Nov 9, 2025Updated 8 months ago
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
AgnetLabs / Laddr
View on GitHub
Laddr is a python framework for building multi-agent systems where agents communicate, delegate tasks, and execute work in parallel. Thin…
☆341Jul 14, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆38Jun 20, 2024Updated 2 years ago
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year
LINs-lab / GMem
View on GitHub
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆43Mar 11, 2025Updated last year
jszheng21 / RACE
View on GitHub
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆14Oct 12, 2024Updated last year
joaomarcoscsilva / mixture-of-experts
View on GitHub
A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.
☆12Mar 19, 2021Updated 5 years ago
inclusionAI / GroveMoE
View on GitHub
☆24Aug 20, 2025Updated 11 months ago
wangf3014 / Patch_Scaling
View on GitHub
Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
☆25Feb 25, 2025Updated last year
davisyoshida / jax-gptq
View on GitHub
JAX implementation of GPTQ quantization algorithm
☆10Jul 19, 2023Updated 3 years ago
microsoft / echo-rl
View on GitHub
☆55May 26, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MoonshotAI / WorldVQA
View on GitHub
☆119Feb 4, 2026Updated 5 months ago
Alibaba-NLP / E2Rank
View on GitHub
E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
☆58Jul 1, 2026Updated 3 weeks ago
0xeb / claude-agent-sdk-cpp
View on GitHub
C++ port of the Python claude-agent-sdk
☆28May 16, 2026Updated 2 months ago
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year
zlab-princeton / llm-distillation-jax
View on GitHub
JAX implementation of configurable LLM distillation training
☆24Nov 15, 2025Updated 8 months ago
phucty / wtabhtml
View on GitHub
Tool to parse wiki tables from the HTML dump of Wikipedia
☆11Jun 12, 2022Updated 4 years ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
kyegomez / MGQA
View on GitHub
The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…
☆17Dec 11, 2023Updated 2 years ago
huggingface / finepdfs
View on GitHub
Codebase for FinePDFs
☆187Jan 9, 2026Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
verl-project / rl-insight
View on GitHub
Provide performance insight capabilities for RL frameworks.
☆47Updated this week
fla-org / native-sparse-attention
View on GitHub
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
☆1,014Feb 5, 2026Updated 5 months ago
huggingface / peft-pytorch-conference
View on GitHub
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆15Oct 16, 2023Updated 2 years ago
Lossfunk / KernelBench-v2
View on GitHub
KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems
☆24Jul 4, 2025Updated last year
notch-ai / autosteer
View on GitHub
Desktop app for multi-workspace Claude Code management
☆67Nov 26, 2025Updated 8 months ago
PKU-AICare / ConfAgents
View on GitHub
ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis
☆15Updated this week
SteamedBread2333 / MarkX
View on GitHub
Professional Markdown editor with Mermaid diagrams & KaTeX formulas. Zero-config, pure static, export to PDF/HTML. Perfect for technical …
☆37Jan 14, 2026Updated 6 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,538Dec 30, 2025Updated 6 months ago
Martinser / REG
View on GitHub
[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think
☆274Oct 4, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
inclusionAI / Ming-UniVision
View on GitHub
Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer
☆143Oct 14, 2025Updated 9 months ago
MoonshotAI / Kimi-Linear
View on GitHub
☆1,498Nov 17, 2025Updated 8 months ago
Ami3466 / tomcp
View on GitHub
Turn any website or doc into an MCP server
☆174Dec 20, 2025Updated 7 months ago
ctlllll / gpt-oss-reverse-engineering
View on GitHub
☆72Aug 6, 2025Updated 11 months ago
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆5,082Updated this week
lezhang7 / TreeMix
View on GitHub
[NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
☆10Jul 15, 2023Updated 3 years ago