zhentingqi / evolmLinks
☆47Updated 6 months ago
Alternatives and similar repositories for evolm
Users that are interested in evolm are comparing it to the libraries listed below
Sorting:
- ☆346Updated 5 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆314Updated 3 weeks ago
- A Sober Look at Language Model Reasoning☆92Updated last month
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆407Updated 6 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 7 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆115Updated 5 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆333Updated 2 months ago
- ☆220Updated 9 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆150Updated 2 months ago
- ☆201Updated 3 weeks ago
- Repo of paper "Free Process Rewards without Process Labels"☆168Updated 9 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆191Updated 10 months ago
- A repo for open research on building large reasoning models☆127Updated this week
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆82Updated last year