furiosa-ai / ssm-peftLinks
Parameter-Efficient Fine-Tuning of State Space Models (ICML 2025)
☆17Updated last month
Alternatives and similar repositories for ssm-peft
Users that are interested in ssm-peft are comparing it to the libraries listed below
Sorting:
- Code for visualizing the loss landscape of neural nets☆10Updated 4 years ago
- [ICLR 2024] Dynamic Sparse Training with Structured Sparsity☆18Updated last year
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆35Updated 4 months ago
- ☆31Updated last month
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆27Updated last year
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆59Updated 3 months ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Updated 7 months ago
- Code for ECML-PKDD 2022 Paper --- CMG: A Class-Mixed Generation Approach to Out-of-Distribution Detection☆12Updated 2 years ago
- Unofficial Implementation of Selective Attention Transformer☆17Updated 8 months ago
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆12Updated last month
- Official Repository for "Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation" [ICLR 2024]☆15Updated last year
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 8 months ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Updated 9 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆56Updated 10 months ago
- ☆16Updated 9 months ago
- ☆33Updated 4 months ago
- ☆30Updated 5 months ago
- ☆14Updated 7 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 9 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆68Updated 2 months ago
- [CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", accepted at (an…☆17Updated 3 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 9 months ago
- Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model☆27Updated 9 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆45Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆28Updated 3 months ago
- [CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".☆23Updated 4 months ago
- Simple Guidance Mechanisms for Discrete Diffusion Models☆46Updated 7 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 7 months ago
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆19Updated last month