[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
Alternatives and similar repositories for goldfish-loss
Users that are interested in goldfish-loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated 2 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆80Apr 3, 2024Updated 2 years ago
- ☆33Nov 27, 2023Updated 2 years ago
- ☆11Oct 20, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Oct 12, 2022Updated 3 years ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- ☆15Mar 2, 2025Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆201May 28, 2024Updated last year
- ☆31Feb 26, 2026Updated 2 months ago
- ☆12Oct 20, 2023Updated 2 years ago
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆33Oct 26, 2023Updated 2 years ago
- Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.☆113Nov 22, 2023Updated 2 years ago
- ☆211Nov 2, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆69Feb 5, 2024Updated 2 years ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- ☆16Jul 17, 2022Updated 3 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 7 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Sep 23, 2023Updated 2 years ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆71Feb 22, 2024Updated 2 years ago
- ☆21Feb 10, 2025Updated last year
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆369May 14, 2024Updated last year
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆26May 15, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for the paper "Autoregressive Perturbations for Data Poisoning" (NeurIPS 2022)☆20Sep 9, 2024Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆61Mar 1, 2022Updated 4 years ago
- Privacy backdoors☆50Apr 28, 2024Updated 2 years ago
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆33Jun 14, 2025Updated 10 months ago
- ☆23Dec 17, 2024Updated last year
- Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning☆17May 14, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆16Jul 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- ☆13Jul 2, 2025Updated 10 months ago
- Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression☆14Mar 22, 2025Updated last year
- ☆16Jul 16, 2024Updated last year
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- An automated data pipeline scaling RL to pretraining levels☆76Oct 11, 2025Updated 6 months ago