pietrolesci / memorisation-profiles
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for memorisation-profiles
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- ☆42Updated 10 months ago
- Influence Experiments☆35Updated last year
- Official Repository for Dataset Inference for LLMs☆23Updated 3 months ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆38Updated 2 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆62Updated 2 years ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆36Updated 5 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 11 months ago
- Methods and evaluation for aligning language models temporally☆24Updated 8 months ago
- TBC☆26Updated 2 years ago
- ☆23Updated 3 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆46Updated 2 years ago
- Rationales for Sequential Predictions☆40Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated last month
- ☆31Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆55Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆51Updated last year
- Min-K%++: Improved baseline for detecting pre-training data of LLMs https://arxiv.org/abs/2404.02936☆26Updated 5 months ago
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 2 years ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆41Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆30Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Learning adapter weights from task descriptions☆15Updated last year
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆18Updated 3 months ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆31Updated 2 years ago