deepseek-ai / DeepSeek-R1Links
☆91,571Updated 5 months ago
Alternatives and similar repositories for DeepSeek-R1
Users that are interested in DeepSeek-R1 are comparing it to the libraries listed below
Sorting:
- ☆100,596Updated 3 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,626Updated 10 months ago
- Fully open reproduction of DeepSeek-R1☆25,717Updated 2 weeks ago
- DeepSeek Coder: Let the Code Write Itself☆22,462Updated 3 weeks ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆5,143Updated 9 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆25,625Updated last month
- Integrate the DeepSeek API into popular softwares☆34,642Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆64,758Updated this week
- DeepSeek LLM: Let there be answers☆6,640Updated last year
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆11,896Updated 2 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆19,893Updated 2 weeks ago
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆157,308Updated this week
- The official Meta Llama 3 GitHub site☆29,116Updated 10 months ago
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆6,279Updated 3 weeks ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆16,985Updated last week
- A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations☆16,129Updated this week
- Inference code for Llama models☆58,968Updated 10 months ago
- A simple screen parsing tool towards pure vision based GUI agent☆23,968Updated 2 months ago
- No fortress, purely open ground. OpenManus is Coming.☆51,194Updated 3 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆20,874Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,980Updated last year
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,937Updated 6 months ago
- DeepEP: an efficient expert-parallel communication library☆8,788Updated this week
- s1: Simple test-time scaling☆6,609Updated 5 months ago
- ☆3,465Updated 9 months ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,431Updated 2 weeks ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,040Updated 7 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆73,350Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆12,467Updated 7 months ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆51,821Updated this week