This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆24Mar 25, 2025Updated last year
Alternatives and similar repositories for memorisation-profiles
Users that are interested in memorisation-profiles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Feb 4, 2025Updated last year
- ☆14Feb 1, 2024Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆61May 11, 2021Updated 5 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)☆15Jul 20, 2023Updated 2 years ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆24Nov 23, 2022Updated 3 years ago
- ☆10Jun 19, 2019Updated 6 years ago
- A list of resources dedicated to compositionality☆14Feb 21, 2019Updated 7 years ago
- The Earleyx parser was originated from Roger Levy's prefix parser, but has evolved significantly. Earleyx can generate Viterbi parses and…☆15Mar 27, 2014Updated 12 years ago
- Brown clustering in Python☆22Dec 12, 2017Updated 8 years ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- PhD thesis template with title page according to the University of Amsterdam.☆15Sep 12, 2021Updated 4 years ago
- Demo for method introduced in "Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs"☆55Jul 23, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Syntactic evaluation sets, attribute-varying grammars, and code for replicating the CLAMS paper. ACL 2020.☆17Nov 26, 2024Updated last year
- FurNet: A Deep-Learning-Based Framework for Removing Furniture Objects in Room Image☆13Nov 22, 2022Updated 3 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- ☆52Apr 7, 2026Updated last month
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆29Jul 23, 2025Updated 10 months ago
- HITsz2020春季学期数据结构实验报告,含题目,代码和实验报告,仅供借鉴。☆12Nov 15, 2021Updated 4 years ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated 2 years ago
- Official Repository for Dataset Inference for LLMs☆41Jul 25, 2024Updated last year
- ☆27Jun 29, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Dec 6, 2024Updated last year
- Can VLMs understand students' hand-drawn math work?☆18Jan 20, 2026Updated 4 months ago
- Official implementation of Data Contamination Can Cross Language Barriers☆12Sep 11, 2024Updated last year
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 7 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 11 months ago
- ☆59May 30, 2024Updated 2 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆19Apr 21, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.☆14Mar 24, 2023Updated 3 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆23May 28, 2025Updated last year
- An implementation of Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning without AllenNL…☆19Feb 20, 2024Updated 2 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago