daeveraert / gradient-information-optimization
Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gradient-information-optimization
- ☆31Updated last year
- ☆61Updated 2 years ago
- ☆26Updated 6 months ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆35Updated 5 months ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆14Updated 6 months ago
- ☆15Updated last week
- ☆44Updated 10 months ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆69Updated 8 months ago
- AI Logging for Interpretability and Explainability🔬☆88Updated 5 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆44Updated last year
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆14Updated 8 months ago
- ☆27Updated last year
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆31Updated last week
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆49Updated 2 weeks ago
- ☆26Updated 9 months ago
- A Survey of Hallucination in Large Foundation Models☆50Updated 10 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆52Updated last month
- Augmenting Statistical Models with Natural Language Parameters☆17Updated last month
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆97Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆57Updated 8 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆41Updated last year
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆45Updated last month
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆9Updated 3 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆96Updated 7 months ago
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆34Updated 2 weeks ago
- LoFiT: Localized Fine-tuning on LLM Representations☆21Updated 4 months ago
- Learning adapter weights from task descriptions☆15Updated last year
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆24Updated 2 months ago