ahans30 / goldfish-lossView external linksLinks
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆94Nov 17, 2024Updated last year
Alternatives and similar repositories for goldfish-loss
Users that are interested in goldfish-loss are comparing it to the libraries listed below
Sorting:
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆33Sep 28, 2025Updated 4 months ago
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated last year
- ☆33Nov 27, 2023Updated 2 years ago
- What do we learn from inverting CLIP models?☆58Mar 6, 2024Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 4 months ago
- Official implementation of GOAT model (ICML2023)☆38Jul 3, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated last year
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Jul 24, 2025Updated 6 months ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- ☆11Oct 20, 2023Updated 2 years ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆77Apr 3, 2024Updated last year
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- ☆14Mar 2, 2025Updated 11 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆16Oct 18, 2025Updated 3 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆24May 15, 2025Updated 9 months ago
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆28Jun 14, 2025Updated 8 months ago
- ☆22Dec 17, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- ☆21Jul 21, 2025Updated 6 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- ☆18Oct 12, 2022Updated 3 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 8 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 3 months ago
- An automated data pipeline scaling RL to pretraining levels☆72Oct 11, 2025Updated 4 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆22Jan 14, 2025Updated last year
- ☆210Nov 2, 2023Updated 2 years ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- Work in progress.☆79Nov 25, 2025Updated 2 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆57Oct 10, 2025Updated 4 months ago
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- ☆31Feb 8, 2026Updated last week
- Fork of Flame repo for training of some new stuff in development☆19Jan 5, 2026Updated last month
- ☆16Jul 17, 2022Updated 3 years ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated 10 months ago
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆34Sep 18, 2024Updated last year