ahans30/goldfish-loss

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ahans30/goldfish-loss)

ahans30 / goldfish-loss

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

☆98

Alternatives and similar repositories for goldfish-loss

Users that are interested in goldfish-loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mcleish7 / gemstone-scaling-laws
View on GitHub
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆35Sep 28, 2025Updated 10 months ago
morse-benchmark / morse-500
View on GitHub
☆31May 21, 2026Updated 2 months ago
hsouri / GDP
View on GitHub
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
☆11Apr 1, 2024Updated 2 years ago
hamidkazemi22 / CLIPInversion
View on GitHub
What do we learn from inverting CLIP models?
☆58Mar 6, 2024Updated 2 years ago
facebookresearch / scalable-curvature
View on GitHub
Code for Dayal Kalra's research internship on scalable curvature measures for neural networks.
☆29Feb 3, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
devnkong / GOAT
View on GitHub
Official implementation of GOAT model (ICML2023)
☆38Jul 3, 2023Updated 3 years ago
vasusingla / simple-data-attribution
View on GitHub
A simple and efficient baseline for data attribution
☆11Nov 10, 2023Updated 2 years ago
YuxinWenRick / diffusion_memorization
View on GitHub
Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)
☆80Apr 3, 2024Updated 2 years ago
seal-rg / streaming
View on GitHub
Code for the paper Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
☆66Jun 23, 2026Updated last month
YuxinWenRick / canary-in-a-coalmine
View on GitHub
☆33Nov 27, 2023Updated 2 years ago
hsouri / bob-classification
View on GitHub
☆11Oct 20, 2023Updated 2 years ago
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
AminJun / ImageNet1KBoundingBoxes
View on GitHub
Pytorch ImageNet1k Loader with Bounding Boxes.
☆13Jan 23, 2022Updated 4 years ago
dayal-kalra / low-memory-adam
View on GitHub
☆14Mar 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hsouri / bob-detection
View on GitHub
☆12Oct 20, 2023Updated 2 years ago
neelsjain / baseline-defenses
View on GitHub
Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"
☆34Oct 26, 2023Updated 2 years ago
somepago / DCR
View on GitHub
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
☆113Nov 22, 2023Updated 2 years ago
songys / huggingface_KoreanDataset
View on GitHub
huggingface에 있는 한국어 데이터 세트
☆37Oct 10, 2024Updated last year
azshue / AutoPoison
View on GitHub
The official repository of the paper "On the Exploitability of Instruction Tuning".
☆70Feb 5, 2024Updated 2 years ago
arpitbansal297 / Certified_Watermarks
View on GitHub
☆16Jul 17, 2022Updated 4 years ago
neelsjain / BYOD
View on GitHub
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆108Sep 23, 2023Updated 2 years ago
JonasGeiping / carving
View on GitHub
Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives
☆71Feb 22, 2024Updated 2 years ago
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
psandovalsegura / autoregressive-poisoning
View on GitHub
Code for the paper "Autoregressive Perturbations for Data Poisoning" (NeurIPS 2022)
☆20Sep 9, 2024Updated last year
facebookresearch / zero
View on GitHub
PyTorch Implementation of Zero-Shot Vision Encoder Grafting via LLM Surrogates [ICCV'25]
☆54Jul 10, 2025Updated last year
ShanglunFengatETHZ / PrivacyBackdoor
View on GitHub
Privacy backdoors
☆50Apr 28, 2024Updated 2 years ago
aks2203 / easy-to-hard
View on GitHub
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆61Mar 1, 2022Updated 4 years ago
hpcgroup / loki
View on GitHub
Algorithms for approximate attention in LLMs
☆22Apr 14, 2025Updated last year
goldblum / free-lunch
View on GitHub
Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning
☆17May 14, 2023Updated 3 years ago
technion-cs-nlp / hallucination-mitigation
View on GitHub
☆23Dec 17, 2024Updated last year
JonasGeiping / fullbatchtraining
View on GitHub
Training vision models with full-batch gradient descent and regularization
☆40Feb 14, 2023Updated 3 years ago
LeonLixyz / LCLM
View on GitHub
latent context language models
☆72Jun 9, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
metterian / korean_bert_score
View on GitHub
BERT score for text generation
☆12Jan 15, 2025Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
juzhengz / logit-fusion
View on GitHub
Learning from Mixed Rollouts: Logit Fusion as a Bridge Between Imitation and Exploration
☆17Feb 24, 2026Updated 5 months ago
songmzhang / DSKDv2
View on GitHub
The official implementation of the paper "A Dual-Space Framework for General Knowledge Distillation of Large Language Models".
☆18Jan 4, 2026Updated 6 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
liuzrcc / ImageShortcutSqueezing
View on GitHub
Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression
☆14Mar 22, 2025Updated last year