convergence-ai / lm2Links
Official repo of paper LM2
☆40Updated 3 months ago
Alternatives and similar repositories for lm2
Users that are interested in lm2 are comparing it to the libraries listed below
Sorting:
- ☆79Updated 9 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆104Updated 2 months ago
- ☆46Updated 3 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆32Updated 2 months ago
- ☆25Updated 4 months ago
- ☆92Updated 8 months ago
- ☆17Updated 5 months ago
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆94Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- ☆19Updated 3 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆36Updated last week
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 7 months ago
- A repository for research on medium sized language models.☆76Updated last year
- ☆32Updated 4 months ago
- Process Reward Models That Think☆38Updated this week
- ☆29Updated 2 weeks ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆170Updated 5 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆59Updated 3 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 8 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆31Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆117Updated this week
- PyTorch library for Active Fine-Tuning☆77Updated 3 months ago
- Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆136Updated this week
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated 2 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated last month
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 2 months ago
- ☆49Updated 3 weeks ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆114Updated 3 months ago
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆49Updated this week