deepseek-ai / awesome-deepseek-coderLinks
A curated list of open-source projects related to DeepSeek Coder
☆710Updated last year
Alternatives and similar repositories for awesome-deepseek-coder
Users that are interested in awesome-deepseek-coder are comparing it to the libraries listed below
Sorting:
- Expert Specialized Fine-Tuning☆649Updated last month
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,741Updated last year
- ☆529Updated 10 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,801Updated last year
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,905Updated last year
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,896Updated 9 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,921Updated 9 months ago
- DeepSeek LLM: Let there be answers☆6,453Updated last year
- ☆3,389Updated 4 months ago
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,693Updated 2 weeks ago
- ☆1,162Updated 2 months ago
- Analyze computation-communication overlap in V3/R1.☆1,075Updated 3 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆582Updated 2 weeks ago
- Expert Parallelism Load Balancer☆1,226Updated 3 months ago
- ☆1,356Updated 7 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,016Updated this week
- A series of math-specific large language models of our Qwen2 series.☆960Updated 5 months ago
- ☆796Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆520Updated last month
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆5,061Updated 2 weeks ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,036Updated last week
- ☆464Updated 10 months ago
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,487Updated last month
- ☆1,551Updated 7 months ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,947Updated 4 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆644Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,642Updated 3 months ago
- Fully open data curation for reasoning models☆1,959Updated last month
- The official repository of the dots.llm1 base and instruct models proposed by rednote-hilab.☆428Updated 3 weeks ago
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,271Updated 3 weeks ago