deepseek-ai / ESFTLinks
Expert Specialized Fine-Tuning
☆649Updated last month
Alternatives and similar repositories for ESFT
Users that are interested in ESFT are comparing it to the libraries listed below
Sorting:
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,741Updated last year
- ☆529Updated 10 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,801Updated last year
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,905Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,502Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆537Updated 2 months ago
- Muon is Scalable for LLM Training☆1,091Updated 3 months ago
- ☆580Updated 2 months ago
- ☆792Updated 3 weeks ago
- A curated list of open-source projects related to DeepSeek Coder☆709Updated last year
- Official Repo for Open-Reasoner-Zero☆1,983Updated last month
- Analyze computation-communication overlap in V3/R1.☆1,075Updated 3 months ago
- Fully open data curation for reasoning models☆1,959Updated last month
- ☆1,356Updated 7 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,921Updated 9 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆561Updated 3 months ago
- Large Reasoning Models☆805Updated 7 months ago
- Scalable RL solution for advanced reasoning of language models☆1,642Updated 3 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆644Updated last month
- Expert Parallelism Load Balancer☆1,226Updated 3 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,794Updated 5 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆798Updated 3 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆571Updated 2 weeks ago
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,813Updated 3 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,036Updated this week
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆946Updated 2 weeks ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.☆2,823Updated 3 months ago
- ReasonFlux Series - Open-source innovative LLM post-training algorithms focusing on data selection, reinforcement learning, and inference…☆442Updated this week
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆410Updated last month
- ☆3,389Updated 4 months ago