oneHuster / MixupE
Codes for "MixupE: Understanding and Improving Mixup from Directional Derivative Perspective" UAI 2023 Oral
☆28Updated last year
Alternatives and similar repositories for MixupE:
Users that are interested in MixupE are comparing it to the libraries listed below
- Training-free data valuation on deep neural network applications. (ICML-2022)☆24Updated 2 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆28Updated 3 months ago
- ☆13Updated 10 months ago
- ☆14Updated 11 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆95Updated last year
- ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse☆51Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆67Updated 4 months ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆16Updated 6 months ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆65Updated 3 weeks ago
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆17Updated last year
- A curated list of Model Merging methods.☆90Updated 5 months ago
- ☆85Updated 2 years ago
- [ICCV2023] Dataset Quantization☆257Updated last year
- Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data☆20Updated 6 months ago
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆41Updated 10 months ago
- Lightweight Adapting for Black-Box Large Language Models☆20Updated last year
- A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.☆273Updated this week
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆55Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆71Updated last year
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆49Updated 4 months ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆101Updated 9 months ago
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Updated 2 years ago
- ☆142Updated 5 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆19Updated 8 months ago
- Codebase for decoding compressed trust.☆23Updated 9 months ago
- Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)☆33Updated last year
- ☆26Updated 2 years ago
- Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark o…☆67Updated this week