zai-org/GLM-OCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zai-org/GLM-OCR)

zai-org / GLM-OCR

GLM-OCR: Accurate × Fast × Comprehensive

☆7,222

Alternatives and similar repositories for GLM-OCR

Users that are interested in GLM-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

datalab-to / chandra
View on GitHub
OCR model that handles complex tables, forms, handwriting with full layout.
☆11,796Jun 26, 2026Updated last month
opendataloader-project / opendataloader-pdf
View on GitHub
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
☆27,924Updated this week
studio-dots-ai / dots.ocr
View on GitHub
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆9,042Mar 24, 2026Updated 4 months ago
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆51,268Updated this week
run-llama / liteparse
View on GitHub
A fast, helpful, and open-source document parser
☆11,834Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
google / langextract
View on GitHub
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…
☆37,920Updated this week
VectifyAI / PageIndex
View on GitHub
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
☆34,793Updated this week
deepseek-ai / DeepSeek-OCR-2
View on GitHub
Visual Causal Flow
☆3,202Feb 3, 2026Updated 5 months ago
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆86,350Jul 22, 2026Updated last week
deepseek-ai / DeepSeek-OCR
View on GitHub
Contexts Optical Compression
☆23,696Jan 27, 2026Updated 6 months ago
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,209Mar 25, 2026Updated 4 months ago
D4Vinci / Scrapling
View on GitHub
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
☆71,704Updated this week
baidu / Unlimited-OCR
View on GitHub
Unlimited OCR Works: Welcome the Era of One-shot Long-horizon Parsing.
☆20,311Updated this week
StarTrail-org / LEANN
View on GitHub
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …
☆12,743Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆69,060Updated this week
AlexsJones / llmfit
View on GitHub
Hundreds of models & providers. One command to find what runs on your hardware.
☆30,892Updated this week
google-research / timesfm
View on GitHub
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecast…
☆27,147Jul 14, 2026Updated 2 weeks ago
bytedance / deer-flow
View on GitHub
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…
☆78,149Updated this week
aaif-goose / goose
View on GitHub
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
☆51,826Updated this week
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,950Updated this week
onyx-dot-app / onyx
View on GitHub
Open Source AI Platform - AI Chat with advanced features that works with every LLM
☆31,252Updated this week
microsoft / markitdown
View on GitHub
Python tool for converting files and office documents to Markdown.
☆169,975Updated this week
bytedance / Dolphin
View on GitHub
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆9,041Mar 25, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
alibaba / zvec
View on GitHub
A lightweight, lightning-fast, in-process vector database
☆15,309Updated this week
RyanCodrai / turbovec
View on GitHub
A vector index built on TurboQuant, written in Rust with Python bindings
☆14,494Updated this week
rowboatlabs / rowboat
View on GitHub
Open-source AI coworker, with memory
☆16,877Updated this week
zai-org / GLM-Image
View on GitHub
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
☆1,008Mar 20, 2026Updated 4 months ago
lightpanda-io / browser
View on GitHub
Lightpanda: the headless browser designed for AI and automation
☆32,934Updated this week
karpathy / autoresearch
View on GitHub
AI agents running research on single-GPU nanochat training automatically
☆92,323Mar 26, 2026Updated 4 months ago
supertone-inc / supertonic
View on GitHub
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆13,547Updated this week
NousResearch / hermes-agent
View on GitHub
The agent that grows with you
☆222,186Updated this week
paperclipai / paperclip
View on GitHub
The open-source app everyone uses to manage agents at work
☆75,065Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alibaba / page-agent
View on GitHub
JavaScript in-page GUI agent. Control web interfaces with natural language.
☆28,141Updated this week
siddharthvaddem / openscreen
View on GitHub
Create stunning demos for free. Open-source, no subscriptions, no watermarks, and free for commercial use. An alternative to Screen Studi…
☆39,864Jun 17, 2026Updated last month
vas3k / TaxHacker
View on GitHub
Self-hosted AI accounting app. LLM analyzer for receipts, invoices, transactions with custom prompts and categories
☆6,555Updated this week
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆76,160Updated this week
jamiepine / voicebox
View on GitHub
The open-source AI voice studio. Clone, dictate, create.
☆47,343Updated this week
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,994Jul 20, 2026Updated last week
memvid / memvid
View on GitHub
Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…
☆16,074Jul 14, 2026Updated 2 weeks ago