deepseek-ai / DeepSeek-R1Links
☆90,232Updated 2 months ago
Alternatives and similar repositories for DeepSeek-R1
Users that are interested in DeepSeek-R1 are comparing it to the libraries listed below
Sorting:
- ☆97,768Updated last week
- DeepSeek Coder: Let the Code Write Itself☆21,766Updated last year
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,380Updated 4 months ago
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆144,510Updated this week
- Fully open reproduction of DeepSeek-R1☆24,859Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆22,175Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆99,512Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆47,108Updated last week
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,903Updated 3 months ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆45,908Updated this week
- 🦜🔗 Build context-aware reasoning applications☆109,727Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,538Updated last week
- DeepSeek LLM: Let there be answers☆6,434Updated last year
- Integrate the DeepSeek API into popular softwares☆32,932Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆50,358Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆40,815Updated this week
- My JS Synopsis☆15Updated 2 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆38,681Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆11,926Updated 2 months ago
- FlashMLA: Efficient MLA decoding kernels☆11,623Updated last month
- The official Meta Llama 3 GitHub site☆28,784Updated 4 months ago
- LLM inference in C/C++☆81,984Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,911Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆52,785Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆40,227Updated this week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆14,423Updated last week
- Inference code for Llama models☆58,399Updated 4 months ago
- A collection of MCP servers.☆55,987Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆11,094Updated last month
- The AI Code Editor☆30,452Updated 8 months ago