jianghoucheng / AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆207Updated 2 weeks ago
Alternatives and similar repositories for AlphaEdit:
Users that are interested in AlphaEdit are comparing it to the libraries listed below
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆67Updated 2 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆205Updated last week
- Paper List of Inference/Test Time Scaling/Computing☆207Updated last week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains …☆203Updated last week
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆112Updated 2 weeks ago
- ☆95Updated 2 weeks ago
- ☆94Updated 3 weeks ago
- [Arxiv 2025] Efficient Reasoning Models: A Survey☆136Updated this week
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆136Updated last month
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆50Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆78Updated last week
- A RLHF Infrastructure for Vision-Language Models☆173Updated 5 months ago
- A comprehensive collection of process reward models.☆74Updated 2 weeks ago
- ☆192Updated 2 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆364Updated 3 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆195Updated 5 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆195Updated last month
- The official code repository for PRMBench.☆72Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆118Updated last month
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆170Updated 3 months ago
- ☆132Updated 9 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆17Updated last week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆72Updated 6 months ago
- The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"☆51Updated last week
- Paper list for Efficient Reasoning.☆412Updated 2 weeks ago
- Latest Advances on Long Chain-of-Thought Reasoning☆273Updated 3 weeks ago
- ☆59Updated 3 weeks ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆68Updated last week
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆350Updated last week
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆90Updated 3 weeks ago