Erasing conceptual knowledge from language models through low-rank fine-tuning
☆23Mar 27, 2025Updated last year
Alternatives and similar repositories for erasing-llm
Users that are interested in erasing-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distilling Diversity and Control in Diffusion Models☆52Apr 28, 2025Updated last year
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆123Nov 25, 2025Updated 6 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Unified Concept Editing in Diffusion Models☆193Dec 7, 2025Updated 6 months ago
- ☆18Jun 8, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Concept Sliders for Precise Control of Diffusion Models☆1,134Apr 13, 2026Updated 2 months ago
- ☆25Dec 12, 2025Updated 6 months ago
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated 2 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆16Nov 25, 2025Updated 6 months ago
- ☆10Oct 29, 2020Updated 5 years ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆38Feb 22, 2025Updated last year
- ☆10Apr 23, 2026Updated last month
- Official implementation of "Opt-In Art: Learning Art Styles Only from Few Examples" (Accepted by NeurIPS 2025)☆33Nov 30, 2025Updated 6 months ago
- Influence Maximization Paper List☆11May 11, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Jan 10, 2023Updated 3 years ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- ☆15Feb 26, 2025Updated last year
- A method to generate counterfactuals☆12Feb 24, 2026Updated 3 months ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.☆12Sep 8, 2023Updated 2 years ago
- Official repository of "A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning" published in NeurIPS'20…☆12Feb 7, 2025Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆26May 13, 2026Updated last month
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆17Jun 9, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 6 months ago
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 3 years ago
- Scalable and computationally efficient deep reinforcement learning framework for influence maximization☆13May 10, 2025Updated last year
- Manage your ever-growing list of research papers☆14Nov 19, 2023Updated 2 years ago
- [NeurIPS'24] What makes unlearning hard and what to do about it☆22May 24, 2025Updated last year
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated 2 years ago
- Single Image Backdoor Inversion via Robust Smoothed Classifiers☆17Jul 18, 2023Updated 2 years ago
- ☆26Oct 18, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Mar 5, 2024Updated 2 years ago
- [ECCV2024] Immunizing text-to-image Models against Malicious Adaptation☆18Jan 17, 2025Updated last year
- Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identi…☆13Feb 28, 2024Updated 2 years ago
- All-in-One Safety Evaluation Framwork☆50Apr 21, 2026Updated last month
- OxML2020☆12Aug 14, 2020Updated 5 years ago
- dynamic graph/network embedding/representation methods☆17Apr 27, 2020Updated 6 years ago
- DDAM-PS: Diligent Domain Adaptive Mixer for Person Search -- WACV2024☆13Feb 28, 2024Updated 2 years ago