deepseek-ai / awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
☆672Updated last year
Alternatives and similar repositories for awesome-deepseek-coder:
Users that are interested in awesome-deepseek-coder are comparing it to the libraries listed below
- Expert Specialized Fine-Tuning☆600Updated 6 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,627Updated last year
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,614Updated 6 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,651Updated last year
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,870Updated 6 months ago
- ☆492Updated 8 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,777Updated 11 months ago
- DeepSeek LLM: Let there be answers☆6,303Updated last year
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,713Updated last month
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,514Updated last week
- Fully open data curation for reasoning models☆1,697Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,484Updated last month
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆4,832Updated last month
- Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.☆553Updated 5 months ago
- The Open Cookbook for Top-Tier Code Large Language Model☆1,675Updated 4 months ago
- Sky-T1: Train your own O1 preview model within $450☆3,197Updated last week
- Scalable RL solution for advanced reasoning of language models☆1,478Updated last month
- Muon is Scalable for LLM Training☆1,020Updated 3 weeks ago
- Home of StarCoder2!☆1,898Updated last year
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆428Updated this week
- Analyze computation-communication overlap in V3/R1.☆991Updated 3 weeks ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆495Updated last month
- ☆390Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,117Updated 2 months ago
- Arena-Hard-Auto: An automatic LLM benchmark.☆776Updated 3 weeks ago
- 🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion …☆402Updated 7 months ago
- A powerful coding assistant application that integrates with the DeepSeek API to process user conversations and generate structured JSON …☆1,444Updated 2 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,616Updated 3 months ago
- Expert Parallelism Load Balancer☆1,136Updated 3 weeks ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.☆2,715Updated last month