☆91,886Jun 27, 2025Updated 8 months ago
Alternatives and similar repositories for DeepSeek-R1
Users that are interested in DeepSeek-R1 are comparing it to the libraries listed below
Sorting:
- ☆101,745Aug 28, 2025Updated 6 months ago
- Fully open reproduction of DeepSeek-R1☆25,910Nov 24, 2025Updated 3 months ago
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆163,632Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,710Feb 1, 2025Updated last year
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆26,650Jan 9, 2026Updated last month
- DeepSeek Coder: Let the Code Write Itself☆22,833Nov 11, 2025Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Integrate the DeepSeek API into popular software☆35,654Updated this week
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆5,233Feb 26, 2025Updated last year
- Production-ready platform for agentic workflow development.☆130,029Updated this week
- 🦜🔗 The platform for reliable agents.☆127,192Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆124,763Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,071Updated this week
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆12,505Feb 6, 2026Updated 3 weeks ago
- Inference code for Llama models☆59,166Jan 26, 2025Updated last year
- No fortress, purely open ground. OpenManus is Coming.☆54,814Feb 11, 2026Updated 2 weeks ago
- LLM inference in C/C++☆95,726Updated this week
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…☆182,031Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆52,724Updated this week
- The official Meta Llama 3 GitHub site☆29,265Jan 26, 2025Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆67,659Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆79,028Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆104,246Updated this week
- Stable Diffusion web UI☆161,110Dec 18, 2025Updated 2 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆20,456Jan 30, 2026Updated last month
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆58,263Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆95,206Dec 15, 2025Updated 2 months ago
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆176,657Updated this week
- DeepEP: an efficient expert-parallel communication library☆9,005Feb 9, 2026Updated 2 weeks ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,386Jan 30, 2026Updated last month
- Minimal reproduction of DeepSeek R1-Zero☆12,853Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,648Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆23,658Updated this week
- Lightweight coding agent that runs in your terminal☆61,978Updated this week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- Model Context Protocol Servers☆79,176Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆97,688Updated this week
- A latent text-to-image diffusion model☆72,575Jun 18, 2024Updated last year
- 🙌 OpenHands: AI-Driven Development☆68,154Updated this week