pietrolesci / memorisation-profilesView external linksLinks
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆24Mar 25, 2025Updated 10 months ago
Alternatives and similar repositories for memorisation-profiles
Users that are interested in memorisation-profiles are comparing it to the libraries listed below
Sorting:
- ☆20Nov 4, 2025Updated 3 months ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Nov 23, 2022Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62May 11, 2021Updated 4 years ago
- ☆17Feb 4, 2025Updated last year
- Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".☆17May 21, 2022Updated 3 years ago
- Official Repository for Dataset Inference for LLMs☆43Jul 25, 2024Updated last year
- ☆15Feb 21, 2024Updated last year
- ☆31Aug 9, 2024Updated last year
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Jul 23, 2025Updated 6 months ago
- ☆25Jun 29, 2025Updated 7 months ago
- ☆58May 30, 2024Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 8 months ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year
- Sparse and discrete interpretability tool for neural networks☆64Feb 12, 2024Updated 2 years ago
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆32Jul 22, 2024Updated last year
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- TBC☆28Nov 2, 2022Updated 3 years ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- ☆35Feb 26, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Nov 14, 2024Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆89Oct 30, 2024Updated last year
- SILO Language Models code repository☆83Feb 23, 2024Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated last year
- ☆37Dec 6, 2024Updated last year
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- ☆11Dec 23, 2024Updated last year
- [ICLR 2024] Scaling physics-informed hard constraints with mixture-of-experts.☆38Jun 21, 2024Updated last year
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Concurrency library☆16Oct 13, 2024Updated last year