daeveraert / gradient-information-optimization
Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection
☆13Updated last year
Alternatives and similar repositories for gradient-information-optimization:
Users that are interested in gradient-information-optimization are comparing it to the libraries listed below
- ☆35Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆76Updated last year
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆17Updated last year
- Augmenting Statistical Models with Natural Language Parameters☆26Updated 7 months ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆17Updated 11 months ago
- ☆49Updated last year
- Bayesian low-rank adaptation for large language models☆23Updated 11 months ago
- ☆29Updated 11 months ago
- ☆11Updated 2 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆11Updated 3 months ago
- ☆66Updated 3 years ago
- ☆18Updated 9 months ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆38Updated 10 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 3 months ago
- ☆12Updated 4 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated 6 months ago
- ☆28Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆75Updated 4 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- ☆27Updated last month
- ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆35Updated 2 months ago
- ☆28Updated 2 months ago
- Methods and evaluation for aligning language models temporally☆29Updated last year
- ☆42Updated last year
- ☆14Updated 6 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆51Updated 2 years ago
- Learning adapter weights from task descriptions☆17Updated last year
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆53Updated last year
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆21Updated last month