Code for "Reasoning to Learn from Latent Thoughts"
☆126Mar 28, 2025Updated 11 months ago
Alternatives and similar repositories for BoLT
Users that are interested in BoLT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 8 months ago
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 8 months ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆55Apr 6, 2025Updated 11 months ago
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆57Dec 26, 2025Updated 2 months ago
- Code for Heima☆59Apr 21, 2025Updated 11 months ago
- ☆225Mar 26, 2025Updated 11 months ago
- ☆14May 4, 2024Updated last year
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆52Oct 23, 2025Updated 5 months ago
- ☆56Apr 11, 2024Updated last year
- ☆16Sep 6, 2024Updated last year
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆21Nov 17, 2025Updated 4 months ago
- ☆48Sep 29, 2024Updated last year
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆16Jun 4, 2024Updated last year
- ☆206Apr 19, 2025Updated 11 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆865Dec 29, 2025Updated 2 months ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 11 months ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- ☆19Mar 10, 2025Updated last year
- ☆41Jul 6, 2025Updated 8 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆186Jul 23, 2025Updated 8 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- ☆23Jun 11, 2025Updated 9 months ago
- ☆142Jan 26, 2026Updated last month
- ☆52Oct 23, 2023Updated 2 years ago
- exploring whether LLMs perform case-based or rule-based reasoning☆30Mar 2, 2024Updated 2 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- ☆14Mar 9, 2025Updated last year
- Function Vectors in Large Language Models (ICLR 2024)☆193Apr 17, 2025Updated 11 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆42Apr 22, 2025Updated 11 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- ☆17Dec 7, 2025Updated 3 months ago
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 9 months ago
- [ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆104Jun 20, 2025Updated 9 months ago
- GenRM-CoT: Data release for verification rationales☆67Oct 16, 2024Updated last year