furiosa-ai / ssm-peft
☆16Updated 6 months ago
Alternatives and similar repositories for ssm-peft
Users that are interested in ssm-peft are comparing it to the libraries listed below
Sorting:
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆33Updated 2 months ago
- ☆28Updated 3 months ago
- Data distillation benchmark☆59Updated this week
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 7 months ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆28Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆58Updated 2 months ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆25Updated 10 months ago
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆54Updated last month
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆72Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆44Updated 6 months ago
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆76Updated last year
- ☆28Updated 2 months ago
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆19Updated 9 months ago
- ☆27Updated 3 weeks ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 6 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆101Updated last year
- Stick-breaking attention☆53Updated 2 months ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆27Updated last year
- Recycling diverse models☆44Updated 2 years ago
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆13Updated last week
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆54Updated 8 months ago
- ☆71Updated 2 months ago
- ☆35Updated last year
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 8 months ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆16Updated 5 months ago
- Codes for Merging Large Language Models☆29Updated 9 months ago
- Unofficial Implementation of Selective Attention Transformer☆16Updated 6 months ago
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆68Updated 3 weeks ago
- Lowering PyTorch's Memory Consumption for Selective Differentiation☆10Updated 8 months ago
- Code for "A Sober Look at Progress in Language Model Reasoning" paper☆45Updated last week