lucywang720 / model-surgeryLinks
☆31Updated 11 months ago
Alternatives and similar repositories for model-surgery
Users that are interested in model-surgery are comparing it to the libraries listed below
Sorting:
- ☆144Updated 10 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Updated 11 months ago
- ☆46Updated 4 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆75Updated 7 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆154Updated 7 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Updated 6 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆75Updated 6 months ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆47Updated 8 months ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆19Updated 2 months ago
- ☆45Updated last month
- ☆63Updated 6 months ago
- A Sober Look at Language Model Reasoning☆92Updated 2 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Updated last month
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 10 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆74Updated 9 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Updated 11 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Updated 10 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆76Updated 11 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Updated last year
- ☆145Updated 4 months ago
- ☆64Updated this week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 11 months ago
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆54Updated 7 months ago
- ☆205Updated last month
- Code for Heima☆59Updated 9 months ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆85Updated last month
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆70Updated 6 months ago