mlwu22 / RED
Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆14Updated last year
Alternatives and similar repositories for RED:
Users that are interested in RED are comparing it to the libraries listed below
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆75Updated 2 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆109Updated last year
- Model merging is a highly efficient approach for long-to-short reasoning.☆43Updated last month
- LoFiT: Localized Fine-tuning on LLM Representations☆37Updated 3 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆78Updated last week
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆20Updated last month
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆31Updated 5 months ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆19Updated 2 months ago
- ☆59Updated 3 weeks ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆136Updated last month
- ☆34Updated 2 months ago
- ☆32Updated 7 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆50Updated last month
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆34Updated last month
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆17Updated 6 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆46Updated last month
- awesome SAE papers☆27Updated 2 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 5 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆46Updated 4 months ago
- [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆76Updated last month
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆23Updated last week
- ☆73Updated 11 months ago
- ☆22Updated 6 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆32Updated 6 months ago
- ☆22Updated 7 months ago
- ☆17Updated 2 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆34Updated 3 months ago
- "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning" by Chongyu Fan*, Jiancheng Liu*, Licong Lin*, Jingh…☆24Updated 2 months ago