☆44Nov 17, 2024Updated last year
Alternatives and similar repositories for semantic-memorization
Users that are interested in semantic-memorization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pile Deduplication Code☆18May 15, 2023Updated 3 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆13May 5, 2022Updated 4 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆27Oct 31, 2022Updated 3 years ago
- ☆13Feb 24, 2020Updated 6 years ago
- ☆13Jan 20, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Mar 1, 2023Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆27Jun 15, 2026Updated 2 weeks ago
- ☆331Jun 7, 2021Updated 5 years ago
- ☆27Apr 1, 2026Updated 2 months ago
- ☆21Oct 15, 2022Updated 3 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- ☆22Mar 18, 2024Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 8 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆598Aug 7, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Building language models to predict more than one token ahead to enable further ahead predictions.☆12May 22, 2025Updated last year
- ☆11Feb 3, 2025Updated last year
- ☆23Jan 25, 2023Updated 3 years ago
- ☆48Jan 21, 2024Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆220Jun 22, 2026Updated last week
- Engine for collecting, uploading, and downloading model activations☆29Apr 2, 2025Updated last year
- Erasing concepts from neural representations with provable guarantees☆255Jan 27, 2025Updated last year
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Mar 30, 2024Updated 2 years ago
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- defaultMODE is a Python framework for creating Discord AI agents with persistent memory and evolving behavior through brain-inspired sele…☆13Apr 21, 2026Updated 2 months ago
- ☆306Jun 10, 2026Updated 2 weeks ago
- ☆79Dec 7, 2023Updated 2 years ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆56Aug 17, 2024Updated last year
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Aug 15, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"☆68May 2, 2023Updated 3 years ago
- Google Research☆47Oct 29, 2022Updated 3 years ago
- ☆25Jan 17, 2025Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆103Sep 5, 2021Updated 4 years ago
- analysis of public NLP corpora☆11Feb 9, 2023Updated 3 years ago
- ☆16Jul 10, 2023Updated 2 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago