deepseek-ai / awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
☆679Updated last year
Alternatives and similar repositories for awesome-deepseek-coder:
Users that are interested in awesome-deepseek-coder are comparing it to the libraries listed below
- Expert Specialized Fine-Tuning☆601Updated 7 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,664Updated last year
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,658Updated 7 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,654Updated last year
- ☆496Updated 8 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,802Updated last year
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,870Updated 7 months ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,757Updated 2 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,557Updated 2 weeks ago
- An Open Large Reasoning Model for Real-World Solutions☆1,483Updated last month
- ☆3,293Updated last month
- DeepSeek LLM: Let there be answers☆6,331Updated last year
- Muon is Scalable for LLM Training☆1,029Updated last month
- Fully open data curation for reasoning models☆1,726Updated 3 weeks ago
- Democratizing Reinforcement Learning for LLMs☆3,123Updated 2 weeks ago
- LiveBench: A Challenging, Contamination-Free LLM Benchmark☆686Updated this week
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆843Updated 9 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆444Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,504Updated last month
- ☆716Updated last week
- A live stream development of RL tunning for LLM agents☆2,515Updated this week
- Analyze computation-communication overlap in V3/R1.☆1,005Updated last month
- Sky-T1: Train your own O1 preview model within $450☆3,220Updated last week
- OLMoE: Open Mixture-of-Experts Language Models☆723Updated last month
- Official Repo for Open-Reasoner-Zero☆1,887Updated 2 weeks ago
- CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆507Updated 2 months ago
- Expert Parallelism Load Balancer☆1,153Updated last month
- The Open Cookbook for Top-Tier Code Large Language Model☆1,685Updated 4 months ago
- ☆1,355Updated 5 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆5,251Updated this week