☆44Nov 17, 2024Updated last year
Alternatives and similar repositories for semantic-memorization
Users that are interested in semantic-memorization are comparing it to the libraries listed below
Sorting:
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆28Oct 31, 2022Updated 3 years ago
- ☆14Feb 24, 2020Updated 6 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Mar 1, 2023Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- ☆329Jun 7, 2021Updated 4 years ago
- ☆24Jan 27, 2026Updated last month
- ☆20Oct 15, 2022Updated 3 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆574Aug 7, 2025Updated 7 months ago
- Building language models to predict more than one token ahead to enable further ahead predictions.☆12May 22, 2025Updated 10 months ago
- ☆11Feb 3, 2025Updated last year
- ☆23Jan 25, 2023Updated 3 years ago
- ☆48Jan 21, 2024Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Updated this week
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 11 months ago
- Erasing concepts from neural representations with provable guarantees☆245Jan 27, 2025Updated last year
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 2 years ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Mar 30, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- defaultMODE is a Python framework for creating Discord AI agents with persistent memory and evolving behavior through brain-inspired sele…☆13Dec 18, 2025Updated 3 months ago
- ☆78Dec 7, 2023Updated 2 years ago
- BigKnow2022: Bringing Language Models Up to Speed☆16Mar 27, 2023Updated 2 years ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- Airlift Challenge starter kit☆10Apr 18, 2025Updated 11 months ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Aug 15, 2023Updated 2 years ago
- Google Research☆46Oct 29, 2022Updated 3 years ago
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆70May 2, 2023Updated 2 years ago
- ☆16Jul 20, 2023Updated 2 years ago
- ☆25Jan 17, 2025Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆100Sep 5, 2021Updated 4 years ago