pixas / NoRMLinks
ICLR 2025
☆30Updated 8 months ago
Alternatives and similar repositories for NoRM
Users that are interested in NoRM are comparing it to the libraries listed below
Sorting:
- ☆125Updated last year
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆143Updated 10 months ago
- CLIP-MoE: Mixture of Experts for CLIP☆55Updated last year
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆78Updated 3 months ago
- LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently (ICML2025 Oral)☆28Updated 3 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆171Updated last week
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Updated last year
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆48Updated 7 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Updated 4 months ago
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆84Updated last year
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆91Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Updated 11 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆53Updated last year
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Updated 6 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆36Updated last year
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Updated last year
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆44Updated 9 months ago
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501☆61Updated last year
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆153Updated 7 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Updated 7 months ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Updated 2 months ago
- ☆152Updated last year
- ☆46Updated last year
- Codes for Merging Large Language Models☆35Updated last year
- Data distillation benchmark☆72Updated 7 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆32Updated last year