Contexts Optical Compression
β22,657Jan 27, 2026Updated last month
Alternatives and similar repositories for DeepSeek-OCR
Users that are interested in DeepSeek-OCR are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMsβ71,883Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β53,029Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.β26,852Jan 9, 2026Updated 2 months ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.β18,505Jan 30, 2026Updated last month
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β67,966Updated this week
- Python tool for converting files and office documents to Markdown.β90,316Feb 20, 2026Updated 2 weeks ago
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/β¦β71,727Updated this week
- Production-ready platform for agentic workflow development.β131,572Updated this week
- β101,885Aug 28, 2025Updated 6 months ago
- β91,926Jun 27, 2025Updated 8 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.β55,275Mar 2, 2026Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creatβ¦β74,309Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β164,248Updated this week
- The agent engineering platformβ128,595Updated this week
- Universal memory layer for AI Agentsβ48,604Updated this week
- Lightweight coding agent that runs in your terminalβ62,963Updated this week
- π Make websites accessible for AI agents. Automate tasks online with ease.β79,644Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,979Updated this week
- Fully open reproduction of DeepSeek-R1β25,927Nov 24, 2025Updated 3 months ago
- LLM inference in C/C++β96,322Mar 2, 2026Updated last week
- SGLang is a high-performance serving framework for large language models and multimodal models.β24,216Updated this week
- No fortress, purely open ground. OpenManus is Coming.β55,070Feb 11, 2026Updated 3 weeks ago
- Get your documents ready for gen AIβ54,754Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ31,296Updated this week
- Tongyi Deep Research, the Leading Open-source Deep Research Agentβ18,337Feb 27, 2026Updated last week
- LlamaIndex is the leading document agent and OCR platformβ47,374Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMsβ19,519Mar 2, 2026Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β125,513Updated this week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ89,344Updated this week
- A programming framework for agentic AIβ55,236Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Modelsβ17,710Feb 1, 2025Updated last year
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ inβ¦β177,812Updated this week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive viβ¦β34,244Feb 25, 2026Updated last week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phoneβ24,027Feb 23, 2026Updated 2 weeks ago
- π OpenHands: AI-Driven Developmentβ68,459Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ61,332Updated this week
- A simple screen parsing tool towards pure vision based GUI agentβ24,448Sep 12, 2025Updated 5 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β104,884Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervisionβ95,527Dec 15, 2025Updated 2 months ago