deepseek-ai / DeepSeek-OCRLinks
Contexts Optical Compression
☆21,144Updated last month
Alternatives and similar repositories for DeepSeek-OCR
Users that are interested in DeepSeek-OCR are comparing it to the libraries listed below
Sorting:
- 高级软件开发技术小组作业☆27Updated last week
- ☆694Updated 2 weeks ago
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆17,484Updated this week
- The absolute trainer to light up AI agents.☆9,436Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,840Updated last month
- A simple yet powerful agent framework that delivers with open-source models☆3,936Updated this week
- gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI☆19,360Updated last month
- Renderer for the harmony response format to be used with gpt-oss☆4,050Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆16,165Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆12,282Updated 3 weeks ago
- Get started with building Fullstack Agents using Gemini 2.5 and LangGraph☆17,455Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,558Updated 2 months ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆17,129Updated last week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,001Updated 5 months ago
- An open protocol enabling communication and interoperability between opaque agentic applications.☆20,910Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,222Updated 2 weeks ago
- LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.☆7,783Updated this week
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,857Updated last month
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,248Updated 5 months ago
- Kimi K2 is the large language model series developed by Moonshot AI team☆9,640Updated last month
- An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. C…☆18,833Updated this week
- Deepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and…☆6,820Updated this week
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆11,579Updated 2 weeks ago
- Trae Agent is an LLM-based agent for general purpose software engineering tasks.☆10,186Updated 2 months ago
- An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.☆16,029Updated this week
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,634Updated 6 months ago
- ☆8,365Updated 3 weeks ago
- Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle managem…☆5,143Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆16,985Updated last week
- GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models☆3,265Updated this week