convergence-ai / lm2
Official repo of paper LM2
☆37Updated 2 months ago
Alternatives and similar repositories for lm2:
Users that are interested in lm2 are comparing it to the libraries listed below
- ☆77Updated 8 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆170Updated 3 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆89Updated 3 weeks ago
- EvaByte: Efficient Byte-level Language Models at Scale☆87Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆114Updated 2 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆53Updated last week
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆55Updated this week
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆98Updated last week
- ☆65Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆177Updated last week
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆30Updated last month
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 2 weeks ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated 10 months ago
- ☆106Updated 3 months ago
- Code for☆27Updated 4 months ago
- A repository for research on medium sized language models.☆76Updated 10 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆47Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆62Updated last month
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆47Updated last month
- ☆24Updated 3 months ago
- ☆46Updated last week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆84Updated 3 weeks ago
- ☆54Updated last week
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆23Updated 3 weeks ago
- Agentic Knowledgeable Self-awareness☆47Updated last week
- ☆48Updated last week
- ☆31Updated 3 months ago
- Official Code Release for "Training a Generally Curious Agent"☆20Updated 3 weeks ago