deepseek-ai / ESFTLinks
Expert Specialized Fine-Tuning
☆708Updated 5 months ago
Alternatives and similar repositories for ESFT
Users that are interested in ESFT are comparing it to the libraries listed below
Sorting:
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,820Updated last year
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,961Updated last year
- OLMoE: Open Mixture-of-Experts Language Models☆899Updated last month
- ☆540Updated last year
- Large Reasoning Models☆806Updated 11 months ago
- Muon is Scalable for LLM Training☆1,348Updated 3 months ago
- ☆817Updated 4 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,524Updated 5 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆557Updated 6 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆4,002Updated last year
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,147Updated last month
- Scalable toolkit for efficient model reinforcement☆1,009Updated this week
- Arena-Hard-Auto: An automatic LLM benchmark.☆950Updated 4 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆729Updated 5 months ago
- DataComp for Language Models☆1,385Updated last month
- ☆1,348Updated 11 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆450Updated 5 months ago
- Scalable RL solution for advanced reasoning of language models☆1,764Updated 7 months ago
- A project to improve skills of large language models☆600Updated this week
- Analyze computation-communication overlap in V3/R1.☆1,112Updated 7 months ago
- Scalable toolkit for efficient model alignment☆843Updated last month
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,616Updated 5 months ago
- A series of math-specific large language models of our Qwen2 series.☆1,024Updated 9 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆697Updated 3 months ago
- A curated list of open-source projects related to DeepSeek Coder☆720Updated last year
- [COLM 2025] LIMO: Less is More for Reasoning☆1,042Updated 3 months ago
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,950Updated 7 months ago
- Fully open data curation for reasoning models☆2,132Updated 2 months ago
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆237Updated last month
- Evaluation suite for LLMs☆365Updated 3 months ago