zhrli324 / RLEditLinks
[ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"
☆16Updated 7 months ago
Alternatives and similar repositories for RLEdit
Users that are interested in RLEdit are comparing it to the libraries listed below
Sorting:
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆157Updated this week
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆88Updated 7 months ago
- ☆166Updated 4 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆123Updated 2 months ago
- ☆50Updated 2 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 7 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆46Updated 2 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆79Updated 9 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆60Updated 11 months ago
- A Sober Look at Language Model Reasoning☆83Updated last week
- ☆122Updated 6 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆42Updated last year
- ☆332Updated last month
- 📜 Paper list on decoding methods for LLMs and LVLMs☆58Updated 2 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆46Updated 3 months ago
- Code for Heima☆53Updated 5 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆329Updated 2 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆80Updated 3 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆42Updated this week
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆26Updated 7 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆61Updated last year
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆40Updated 5 months ago
- ☆43Updated 5 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 6 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆178Updated 6 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆18Updated this week
- ☆69Updated 10 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆179Updated 2 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆69Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆210Updated last week