Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection
☆14Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for gradient-information-optimization
Users that are interested in gradient-information-optimization are comparing it to the libraries listed below
Sorting:
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆21Feb 29, 2024Updated 2 years ago
- Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selecti…☆23May 17, 2024Updated last year
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- ☆44Oct 13, 2023Updated 2 years ago
- ☆30Jun 12, 2023Updated 2 years ago
- A simple web demo with minimal framework using PyTorch and Streamlit to showcase an image classification model.☆12Dec 17, 2022Updated 3 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 8 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Nov 14, 2024Updated last year
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- Wrapper for Ckmeans.1d.dp.☆13Mar 20, 2025Updated last year
- ☆22Feb 4, 2026Updated last month
- ☆10Jun 19, 2023Updated 2 years ago
- The code for the Network Binarization via Contrastive Learning, which has been accepted to ECCV 2022.☆14Jul 13, 2022Updated 3 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- ☆10Mar 18, 2022Updated 4 years ago
- ☆16Jan 16, 2024Updated 2 years ago
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆22May 21, 2025Updated 10 months ago
- ☆17Feb 18, 2026Updated last month
- Ensemble Learning of Foundation Models☆17Aug 29, 2025Updated 6 months ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆185Jun 24, 2025Updated 8 months ago
- ☆15Oct 4, 2024Updated last year
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆13Jun 20, 2025Updated 9 months ago
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆23Sep 19, 2021Updated 4 years ago
- This is the pytorch\DGL implementation of the AMIGO paper.☆10Feb 6, 2024Updated 2 years ago
- ☆10Sep 16, 2022Updated 3 years ago
- ☆23Aug 7, 2023Updated 2 years ago
- Examples to control the Opal C1 from within python.☆16May 7, 2023Updated 2 years ago
- Improved interface for the LSF batch job scheduler☆13Apr 11, 2018Updated 7 years ago
- ☆12Apr 22, 2024Updated last year
- ☆12Jun 21, 2022Updated 3 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Jan 11, 2024Updated 2 years ago
- Efficient misspecification uncertainties for linear regression☆16Updated this week
- ☆13Jan 18, 2023Updated 3 years ago