deepseek-ai / ESFTLinks
Expert Specialized Fine-Tuning
☆639Updated 3 weeks ago
Alternatives and similar repositories for ESFT
Users that are interested in ESFT are comparing it to the libraries listed below
Sorting:
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,770Updated last year
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,723Updated last year
- ☆523Updated 10 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆534Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆785Updated 3 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,497Updated 3 weeks ago
- A curated list of open-source projects related to DeepSeek Coder☆704Updated last year
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,886Updated last year
- ☆789Updated last week
- Large Reasoning Models☆804Updated 6 months ago
- Muon is Scalable for LLM Training☆1,077Updated 2 months ago
- ☆938Updated 4 months ago
- LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.☆231Updated 9 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆545Updated 3 months ago
- ☆569Updated 2 months ago
- LIMO: Less is More for Reasoning☆960Updated 2 months ago
- Fully open data curation for reasoning models☆1,921Updated 2 weeks ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,055Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,615Updated 3 months ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆632Updated 2 weeks ago
- AllenAI's post-training codebase☆3,018Updated this week
- Scalable toolkit for efficient model alignment☆814Updated 3 weeks ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆713Updated 3 months ago
- ☆773Updated last month
- Automatic evals for LLMs☆429Updated 2 weeks ago
- ReasonFlux Series - Open-Sourced LLM Family for Reasoning, Coding, Reward Modeling and Data Selection☆406Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,909Updated 8 months ago
- OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…☆204Updated last week
- A series of math-specific large language models of our Qwen2 series.☆952Updated 5 months ago
- A project to improve skills of large language models☆423Updated this week