Localizing Memorized Sequences in Language Models
☆22Oct 15, 2025Updated 7 months ago
Alternatives and similar repositories for memorization
Users that are interested in memorization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- A Wasserstein Subsequence Kernel for Time Series.☆21Jun 17, 2024Updated last year
- Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"☆11Dec 20, 2023Updated 2 years ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆18Jan 11, 2021Updated 5 years ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆68Aug 15, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- V-Mapper -☆10Aug 6, 2023Updated 2 years ago
- The code of "dp-promise: Differentially Private Diffusion Probabilistic Models for Image Synthesis"☆23Apr 5, 2024Updated 2 years ago
- PyTorch Implementation of GPT-2☆34Sep 4, 2024Updated last year
- rest2vec: Vectorizing the resting-state functional connectome using graph embedding☆10Jul 6, 2023Updated 2 years ago
- Acquire features from a 3D object using a ray-cast approach.☆22Mar 31, 2025Updated last year
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated last year
- ☆10Nov 6, 2024Updated last year
- arXiv plain text extraction☆42Dec 8, 2022Updated 3 years ago
- Implementation of the SPN model and the experiments from the LoG 2022 paper "Shortest Path Networks for Graph Property Prediction".☆25Feb 7, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- Mapper Interactive is a customizable visualization framework for the analysis and visualization of high-dimensional point cloud data usin…☆26Mar 31, 2026Updated 2 months ago
- 🔋 Utilities for scientific python☆19Oct 16, 2025Updated 7 months ago
- This repo is the implementations of the baselines in SVD dataset.☆25Aug 16, 2020Updated 5 years ago
- ☆12Jul 30, 2025Updated 10 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ☆22Jul 20, 2022Updated 3 years ago
- My toy model for natural language inference task.☆11Aug 6, 2018Updated 7 years ago
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆48Mar 26, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2024] Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain☆11Nov 24, 2025Updated 6 months ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- Open-source code for ''Individual Fairness for Graph Neural Networks: A Ranking based Approach''.☆11Jul 8, 2022Updated 3 years ago
- ☆40Dec 19, 2024Updated last year
- Manipulate tensors with PackedSequence and CattedSequence☆12Jan 4, 2026Updated 5 months ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆52Nov 8, 2024Updated last year
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated last year
- Energy Landscape Analysis Toolbox (ELAT) for MATLAB☆34Nov 12, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official PyTorch implementation of RadMamba☆23Aug 25, 2025Updated 9 months ago
- Watermarking against model extraction attacks in MLaaS. ACM MM 2021.☆34Jul 15, 2021Updated 4 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 5 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated 2 years ago
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Jun 3, 2019Updated 7 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated 2 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago