flamewei123 / DEPNView external linksLinks
☆24Apr 20, 2024Updated last year
Alternatives and similar repositories for DEPN
Users that are interested in DEPN are comparing it to the libraries listed below
Sorting:
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 2 years ago
- Code and data repository for "The Mirage of Model Editing: Revisiting Evaluation in the Wild"☆16Aug 27, 2025Updated 5 months ago
- ☆17Nov 7, 2023Updated 2 years ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- ☆57Jun 13, 2024Updated last year
- ☆25Aug 18, 2023Updated 2 years ago
- ☆28Sep 13, 2024Updated last year
- Retired problem sets and lab exercises made available for self-study.☆16Sep 20, 2021Updated 4 years ago
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆38Sep 26, 2024Updated last year
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆151Jul 19, 2024Updated last year
- ☆11Jun 7, 2023Updated 2 years ago
- Introduction to Random Forest Algorithm for classification problem and how to select important feaatures in your dataset.☆12Aug 1, 2020Updated 5 years ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆37Jun 10, 2024Updated last year
- A conversational recommender system dataset with high-quality explanations.☆11Apr 26, 2023Updated 2 years ago
- ☆13Apr 10, 2025Updated 10 months ago
- ☆14Jan 6, 2025Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- ☆15Mar 20, 2025Updated 10 months ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆12Jun 16, 2023Updated 2 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- UCPR: User-Centric Path Reasoning towards Explainable Recommendation, SIGIR 2021☆12Jun 18, 2022Updated 3 years ago
- ☆12May 2, 2022Updated 3 years ago
- AMT-CDR: A Deep Adversarial Multi-channel Transfer Network for Cross-domain Recommendation☆12Nov 2, 2023Updated 2 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 4 months ago
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated 11 months ago
- ☆14Mar 15, 2025Updated 10 months ago
- ☆14Jan 17, 2026Updated 3 weeks ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- ☆18May 30, 2025Updated 8 months ago
- Thai word segmentation using deep learning☆14Jul 1, 2019Updated 6 years ago
- Code repository of Machine Learning for Quantum Chemistry book☆10Jun 5, 2023Updated 2 years ago
- code for unsupervised entity resolution☆10Apr 26, 2019Updated 6 years ago
- LLM Unlearning☆181Oct 20, 2023Updated 2 years ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- Top-tier conference papers on out-of-distribution detection☆11Jun 22, 2023Updated 2 years ago
- Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …☆10May 20, 2023Updated 2 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year