deepseek-ai / DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
☆954Updated 9 months ago
Alternatives and similar repositories for DeepSeek-Math:
Users that are interested in DeepSeek-Math are comparing it to the libraries listed below
- Code for Quiet-STaR☆698Updated 4 months ago
- ☆996Updated last month
- Recipes to scale inference-time compute of open models☆932Updated this week
- Large Reasoning Models☆787Updated last month
- ☆812Updated last week
- Arena-Hard-Auto: An automatic LLM benchmark.☆708Updated 2 weeks ago
- ☆366Updated 5 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆531Updated last month
- Scalable RL solution for advanced reasoning of language models☆873Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆831Updated last month
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆800Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,378Updated last month
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,099Updated 8 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆785Updated 2 weeks ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,060Updated this week
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆687Updated 3 months ago
- ☆484Updated last month
- MINT-1T: A one trillion token multimodal interleaved dataset.☆788Updated 5 months ago
- ☆1,137Updated last month
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,358Updated 9 months ago
- Reference implementation of Megalodon 7B model☆512Updated 8 months ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆565Updated last week
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆977Updated 3 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,398Updated 9 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,083Updated last year
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆377Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆970Updated this week
- ☆2,289Updated this week
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,824Updated last year