tanganke / peta
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆19Updated 6 months ago
Alternatives and similar repositories for peta:
Users that are interested in peta are comparing it to the libraries listed below
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆42Updated 5 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆70Updated 4 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆40Updated 5 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆52Updated 3 weeks ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆97Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆23Updated 2 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆21Updated 9 months ago
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆17Updated last month
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆61Updated 2 months ago
- ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)☆22Updated 4 months ago
- ☆48Updated 4 months ago
- Data distillation benchmark☆58Updated this week
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆31Updated 2 months ago
- Codes for Merging Large Language Models☆29Updated 7 months ago
- ☆50Updated last year
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆17Updated 6 months ago
- ☆17Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆33Updated 8 months ago
- LCA-on-the-line (ICML 2024 Oral)☆11Updated last month
- A curated list of Model Merging methods.☆91Updated 6 months ago
- ☆21Updated 9 months ago
- Elucidated Dataset Condensation (NeurIPS 2024)☆20Updated 5 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆82Updated 8 months ago
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆31Updated 2 weeks ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆30Updated 4 months ago
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆26Updated 2 weeks ago
- Awesome-Low-Rank-Adaptation☆83Updated 5 months ago
- ☆11Updated last month
- ☆17Updated last week
- ☆11Updated last year