☆44Nov 17, 2024Updated last year
Alternatives and similar repositories for semantic-memorization
Users that are interested in semantic-memorization are comparing it to the libraries listed below
Sorting:
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- ☆10Feb 3, 2025Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆28Oct 31, 2022Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- ☆23Jan 27, 2026Updated last month
- Building language models to predict more than one token ahead to enable further ahead predictions.☆12May 22, 2025Updated 9 months ago
- ☆16Jul 20, 2023Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Updated this week
- Tools for understanding how transformer predictions are built layer-by-layer☆567Aug 7, 2025Updated 6 months ago
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 10 months ago
- BigKnow2022: Bringing Language Models Up to Speed☆16Mar 27, 2023Updated 2 years ago
- A library for mechanistic anomaly detection☆22Jan 9, 2025Updated last year
- ☆38Apr 17, 2024Updated last year
- ☆14Feb 24, 2020Updated 6 years ago
- Erasing concepts from neural representations with provable guarantees☆243Jan 27, 2025Updated last year
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 5 months ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- ☆22Jan 25, 2023Updated 3 years ago
- ☆23Jan 17, 2025Updated last year
- ☆23Jan 27, 2025Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Mar 1, 2023Updated 3 years ago
- ☆48Jan 21, 2024Updated 2 years ago
- ☆20Feb 11, 2024Updated 2 years ago
- ☆71Jul 24, 2025Updated 7 months ago
- Google Research☆46Oct 29, 2022Updated 3 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- ☆329Jun 7, 2021Updated 4 years ago
- A research repo for experiments about Reinforcement Finetuning☆54Apr 7, 2025Updated 10 months ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆28Sep 27, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year