Trustworthy-ML-Lab / ThinkEditLinks

[EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length is encoded in the model’s representation space.

☆15

Alternatives and similar repositories for ThinkEdit

Users that are interested in ThinkEdit are comparing it to the libraries listed below

Sorting:

VITA-Group / SEAL
Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆44Updated 6 months ago
AlphaLab-USTC / LRM-plans-CoT
The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"
☆25Updated 3 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆85Updated 2 weeks ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆79Updated 4 months ago
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆37Updated 3 months ago
Guangxuan-Xiao / GSM8K-eval
☆53Updated 2 years ago
uw-nsl / safechain
[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
☆20Updated 6 months ago
StarDewXXX / O1-Pruner
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆93Updated 8 months ago
ChnQ / MI-Peaks
☆54Updated 3 months ago
Model-GLUE / Model-GLUE
☆18Updated last year
boyiwei / alignment-attribution-code
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
☆85Updated 6 months ago
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆80Updated 10 months ago
licong-lin / negative-preference-optimization
☆66Updated last year
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆51Updated last year
git-disl / Booster
This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…
☆32Updated 7 months ago
git-disl / Vaccine
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆47Updated 11 months ago
javiferran / sae_entities
☆63Updated 7 months ago
decoding-comp-trust / comp-trust
Codebase for decoding compressed trust.
☆24Updated last year
uiuctml / Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆30Updated last month
Zanette-Labs / efficient-reasoning
☆67Updated 6 months ago
swj0419 / muse_bench
☆28Updated 7 months ago
Unispac / shallow-vs-deep-alignment
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
☆157Updated 6 months ago
Lingkai-Kong / RE-Control
Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
☆34Updated 8 months ago
git-disl / Safety-Tax
This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".
☆26Updated 7 months ago
wang2226 / Awesome-LLM-Decoding
📜 Paper list on decoding methods for LLMs and LVLMs
☆61Updated 3 months ago
StarDewXXX / AdaR1
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆19Updated last month
hemingkx / TokenSkip
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆186Updated 3 months ago
sail-sg / LightTrans
The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
☆20Updated 6 months ago
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
OPTML-Group / Unlearn-Simple
[NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"
☆33Updated 3 weeks ago