furiosa-ai / ssm-peftLinks
Parameter-Efficient Fine-Tuning of State Space Models (ICML 2025)
☆18Updated 3 months ago
Alternatives and similar repositories for ssm-peft
Users that are interested in ssm-peft are comparing it to the libraries listed below
Sorting:
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆37Updated last month
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆62Updated 5 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆179Updated this week
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆296Updated 9 months ago
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆503Updated 3 months ago
- A collection of papers on discrete diffusion models☆160Updated 2 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆212Updated 8 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆104Updated 2 years ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆31Updated 2 months ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆363Updated this week
- A curated list for awesome discrete diffusion models resources.☆450Updated last week
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆76Updated last year
- Code accompanying the paper "Massive Activations in Large Language Models"☆179Updated last year
- ☆31Updated 3 months ago
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆380Updated 2 months ago
- Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)☆39Updated 2 months ago
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆111Updated 2 months ago
- Awesome-Low-Rank-Adaptation☆116Updated 11 months ago
- The repo for HiRA paper☆29Updated 2 months ago
- [ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)☆637Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆49Updated 10 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆222Updated 9 months ago
- ☆14Updated last year
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆247Updated last week
- ☆209Updated 11 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆96Updated 4 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆534Updated this week
- ☆58Updated 9 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆74Updated 4 months ago
- ☆74Updated 3 years ago