pietrolesci/memorisation-profiles

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pietrolesci/memorisation-profiles)

pietrolesci / memorisation-profiles

This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".

☆25

Alternatives and similar repositories for memorisation-profiles

Users that are interested in memorisation-profiles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

technion-cs-nlp / parametric-faithfulness
View on GitHub
☆23Aug 30, 2025Updated 10 months ago
wenzhe-li / Self-MoA
View on GitHub
☆17Feb 4, 2025Updated last year
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
rycolab / prefix-parsing
View on GitHub
☆14Feb 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
verypluming / HELP
View on GitHub
HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)
☆15Jul 20, 2023Updated 3 years ago
wyu97 / RACo
View on GitHub
Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.
☆24Nov 23, 2022Updated 3 years ago
babylm / evaluation-pipeline-2025
View on GitHub
☆26Aug 19, 2025Updated 11 months ago
facebookresearch / multiloko
View on GitHub
A benchmark with locally sourced multilingual questions for 31 languages.
☆18May 13, 2026Updated 2 months ago
yangyuan / brown-clustering
View on GitHub
Brown clustering in Python
☆22Dec 12, 2017Updated 8 years ago
orionw / Multilingual-Federated-Learning
View on GitHub
Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022
☆11Aug 9, 2022Updated 3 years ago
babylm / babylm.github.io
View on GitHub
☆16Jul 20, 2026Updated last week
jamie-murdoch / ContextualDecomposition
View on GitHub
Demo for method introduced in "Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs"
☆55Jul 23, 2020Updated 6 years ago
kaistAI / GAP
View on GitHub
[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization
☆29Sep 12, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
wwangwitsel / ConfDiff
View on GitHub
[NeurIPS'23] Binary Classification with Confidence Difference
☆10May 13, 2024Updated 2 years ago
cywinski / eliciting-secret-knowledge
View on GitHub
Code repository for "Eliciting Secret Knowledge from Language Models"
☆24Mar 30, 2026Updated 3 months ago
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
causalNLP / cladder
View on GitHub
We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.
☆147May 29, 2024Updated 2 years ago
catherinearnett / morphscore
View on GitHub
This is the repository for MorphScore, a tokenizer evaluation framework for morphological alignment.
☆17Jul 10, 2025Updated last year
pratyushmaini / llm_dataset_inference
View on GitHub
Official Repository for Dataset Inference for LLMs
☆41Jul 25, 2024Updated 2 years ago
Aolius / semi-fst
View on GitHub
Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".
☆17May 21, 2022Updated 4 years ago
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zepingyu0512 / in-context-mechanism
View on GitHub
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Nov 17, 2024Updated last year
revelio-diffusion / revelio
View on GitHub
☆26Jun 29, 2025Updated last year
Kaleidophon / token2index
View on GitHub
A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …
☆50Dec 6, 2024Updated last year
shangdatalab / Deep-Contam
View on GitHub
Official implementation of Data Contamination Can Cross Language Barriers
☆12Sep 11, 2024Updated last year
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Jul 18, 2026Updated last week
INK-USC / ReCross
View on GitHub
ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation
☆23May 1, 2022Updated 4 years ago
michahu / pre-pretraining
View on GitHub
Accelerate pretraining by pre-pretraining on formal languages!
☆20Feb 13, 2026Updated 5 months ago
AlexWan0 / Poisoning-Instruction-Tuned-Models
View on GitHub
☆59May 30, 2024Updated 2 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 3 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bond005 / impartial_text_cls
View on GitHub
Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.
☆14Mar 24, 2023Updated 3 years ago
INK-USC / DIG
View on GitHub
Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)
☆27Mar 26, 2022Updated 4 years ago
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
yeliu0930 / Knowledge-guided-Open-Attribute-Value-Extraction-with-Reinforcement-Learning
View on GitHub
☆10Oct 19, 2020Updated 5 years ago
Kernel-Machines / kermac
View on GitHub
Pytorch routines for (Ker)nel (Mac)hines
☆12Oct 10, 2025Updated 9 months ago
QxLabIreland / AQP
View on GitHub
☆23Jun 13, 2022Updated 4 years ago
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago