deepseek-ai / awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
☆571Updated 10 months ago
Alternatives and similar repositories for awesome-deepseek-coder:
Users that are interested in awesome-deepseek-coder are comparing it to the libraries listed below
- Expert Specialized Fine-Tuning☆529Updated 5 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,409Updated last year
- ☆422Updated 6 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,389Updated 10 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,507Updated 9 months ago
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,163Updated 4 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,769Updated 4 months ago
- Arena-Hard-Auto: An automatic LLM benchmark.☆745Updated last month
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆834Updated 7 months ago
- DeepSeek LLM: Let there be answers☆5,914Updated last year
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆325Updated 3 weeks ago
- 🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion …☆391Updated 5 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,455Updated 2 months ago
- Inference engine powering open source models on OpenRouter☆745Updated last month
- ☆404Updated last year
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆600Updated 8 months ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆3,827Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆4,510Updated last week
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆293Updated this week
- A series of math-specific large language models of our Qwen2 series.☆807Updated last month
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,369Updated last month
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,601Updated 3 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆345Updated 3 weeks ago
- Scalable RL solution for advanced reasoning of language models