WanliYoung / Revisit-Editing-EvaluationView external linksLinks
Code and data repository for "The Mirage of Model Editing: Revisiting Evaluation in the Wild"
☆16Aug 27, 2025Updated 5 months ago
Alternatives and similar repositories for Revisit-Editing-Evaluation
Users that are interested in Revisit-Editing-Evaluation are comparing it to the libraries listed below
Sorting:
- LaTeX Drawing☆18Dec 22, 2025Updated last month
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆18Dec 16, 2024Updated last year
- ☆24Apr 20, 2024Updated last year
- ☆27Oct 6, 2024Updated last year
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆29Oct 1, 2024Updated last year
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆38Sep 26, 2024Updated last year
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆44Dec 4, 2024Updated last year
- ☆10Jul 6, 2023Updated 2 years ago
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 8 months ago
- ☆10Jul 3, 2024Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated 11 months ago
- ☆14Mar 15, 2025Updated 11 months ago
- ☆10Apr 14, 2022Updated 3 years ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆12Aug 6, 2024Updated last year
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- ☆10Sep 8, 2022Updated 3 years ago
- ☆13Sep 8, 2024Updated last year
- emo•ji for all (LaTeX engines) 🎉☆15May 22, 2023Updated 2 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆55Sep 28, 2023Updated 2 years ago
- ☆16Sep 9, 2021Updated 4 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- Code and data for the ACM CIKM 2024 paper "Adversarial Text Rewriting for Text-aware Recommender Systems"☆12Aug 1, 2024Updated last year
- [IROS 2024] "ComTraQ-MPC: Meta-Trained DQN-MPC Integration for Trajectory Tracking with Limited Active Localization Updates" by Gokul Put…☆13Apr 10, 2025Updated 10 months ago
- Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …☆10May 20, 2023Updated 2 years ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- Code for "SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism""☆10Apr 17, 2021Updated 4 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation☆11Dec 23, 2023Updated 2 years ago
- An implementation of vdist2vec model in paper A Learning Based Approach to Predict Shortest-Path Distances☆11Apr 8, 2022Updated 3 years ago
- The implementation of Meta-Pec☆12Sep 13, 2023Updated 2 years ago
- 【入口,请看这里!】Bulletin of our awesome collections 📓📔📒📕📗📘📙📚📖🔖☆11Mar 13, 2018Updated 7 years ago
- Tools for optimizing steering vectors in LLMs.☆19Apr 10, 2025Updated 10 months ago
- ☆57Jun 13, 2024Updated last year
- [EMNLP 2024 Main] Code for the paper "Dissecting Fine-Tuning Unlearning in Large Language Models"☆15Oct 10, 2024Updated last year
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated last year
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆14Feb 6, 2024Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- ☆34Feb 11, 2025Updated last year