deepseek-ai / ESFTLinks

Expert Specialized Fine-Tuning

☆708

Alternatives and similar repositories for ESFT

Users that are interested in ESFT are comparing it to the libraries listed below

Sorting:

deepseek-ai / DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
☆1,820Updated last year
deepseek-ai / DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
☆2,961Updated last year
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆899Updated last month
deepseek-ai / DeepSeek-Prover-V1.5
☆540Updated last year
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆806Updated 11 months ago
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,348Updated 3 months ago
ByteDance-Seed / Seed-Thinking-v1.5
☆817Updated 4 months ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,524Updated 5 months ago
hkust-nlp / CodeIO
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
☆557Updated 6 months ago
deepseek-ai / DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
☆4,002Updated last year
microsoft / MInference
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…
☆1,147Updated last month
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆1,009Updated this week
lmarena / arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
☆950Updated 4 months ago
SkyworkAI / Skywork-OR1
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆729Updated 5 months ago
mlfoundations / dclm
DataComp for Language Models
☆1,385Updated last month
Open-Source-O1 / Open-O1
☆1,348Updated 11 months ago
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆450Updated 5 months ago
PRIME-RL / PRIME
Scalable RL solution for advanced reasoning of language models
☆1,764Updated 7 months ago
NVIDIA-NeMo / Skills
A project to improve skills of large language models
☆600Updated this week
deepseek-ai / profile-data
Analyze computation-communication overlap in V3/R1.
☆1,112Updated 7 months ago
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆843Updated last month
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,616Updated 5 months ago
QwenLM / Qwen2.5-Math
A series of math-specific large language models of our Qwen2 series.
☆1,024Updated 9 months ago
LiveCodeBench / LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
☆697Updated 3 months ago
deepseek-ai / awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
☆720Updated last year
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆1,042Updated 3 months ago
MoonshotAI / MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
☆1,950Updated 7 months ago
open-thoughts / open-thoughts
Fully open data curation for reasoning models
☆2,132Updated 2 months ago
FlagAI-Open / OpenSeek
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…
☆237Updated last month
allenai / OLMo-Eval
Evaluation suite for LLMs
☆365Updated 3 months ago