jiaconghu / Model-LEGOLinks
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks
☆17Updated 11 months ago
Alternatives and similar repositories for Model-LEGO
Users that are interested in Model-LEGO are comparing it to the libraries listed below
Sorting:
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Updated 11 months ago
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆64Updated 3 months ago
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆97Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆189Updated last week
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆74Updated 9 months ago
- [NeurIPS 2023] Understanding and Improving Feature Learning for Out-of-Distribution Generalization☆29Updated 6 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆30Updated last year
- A curated list of Model Merging methods.☆94Updated 2 weeks ago
- The code repository for ICML24 paper "Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images"☆22Updated 9 months ago
- Awesome Low-Rank Adaptation☆57Updated 4 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆51Updated last year
- Awesome-Low-Rank-Adaptation☆124Updated last year
- Awesome Learn From Model Beyond Fine-Tuning: A Survey☆80Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated 2 years ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆38Updated last year
- ☆18Updated last year
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15Updated last year
- [NeurIPS2023] Official code of "Understanding Contrastive Learning via Distributionally Robust Optimization"☆40Updated 2 years ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated 2 years ago
- Official code for ICLR 2023 paper "ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond "☆35Updated 2 years ago
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆16Updated last year
- The repo for HiRA paper☆34Updated 5 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Updated last year
- [NeurIPS'23] FedL2P: Federated Learning to Personalize☆24Updated last month
- BackTime: Backdoor Attacks on Multivariate Time Series Forecasting☆29Updated 8 months ago
- ☆37Updated last year
- [ICML2025] Test-Time Learning for Large Language Models☆37Updated 3 months ago