ZJU-REAL / EasySteerLinks
A Unified Framework for High-Performance and Extensible LLM Steering
☆110Updated last week
Alternatives and similar repositories for EasySteer
Users that are interested in EasySteer are comparing it to the libraries listed below
Sorting:
- ☆36Updated last month
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆20Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆87Updated 9 months ago
- ☆29Updated 3 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆121Updated 7 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆27Updated 9 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆71Updated 6 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆97Updated 8 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆56Updated 3 weeks ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆44Updated 3 weeks ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆80Updated 4 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆83Updated 7 months ago
- ☆69Updated 5 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 5 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆31Updated 8 months ago
- Official Repository of LatentSeek☆67Updated 5 months ago
- ☆131Updated 8 months ago
- ☆181Updated 6 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆39Updated last month
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆67Updated 4 months ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Updated 4 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆47Updated 2 months ago
- Towards a Unified View of Large Language Model Post-Training☆183Updated 2 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆189Updated last week
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated 2 months ago
- ☆153Updated 3 weeks ago
- ☆38Updated 3 months ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆15Updated 3 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆47Updated 3 weeks ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆148Updated 4 months ago