ZJU-REAL / EasySteerLinks
A Unified Framework for High-Performance and Extensible LLM Steering
☆85Updated last week
Alternatives and similar repositories for EasySteer
Users that are interested in EasySteer are comparing it to the libraries listed below
Sorting:
- ☆36Updated 2 weeks ago
- ☆29Updated 2 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 8 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆42Updated last week
- [NeurIPS 2025] Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆49Updated 3 weeks ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆56Updated 2 weeks ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆34Updated 3 weeks ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Updated 4 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆93Updated 8 months ago
- Official Repository of LatentSeek☆66Updated 4 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆187Updated last week
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆26Updated 8 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆47Updated 3 weeks ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆56Updated 4 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 4 months ago
- ☆38Updated 2 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆23Updated 2 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 4 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆112Updated 6 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆69Updated 6 months ago
- ☆68Updated 4 months ago
- ☆15Updated 4 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆19Updated last month
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated last month
- ☆29Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆83Updated 7 months ago
- Extrapolating RLVR to General Domains without Verifiers☆174Updated 2 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆46Updated 3 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆177Updated last week
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆155Updated last week