gstoica27 / KnOTS
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆45Updated last month
Alternatives and similar repositories for KnOTS:
Users that are interested in KnOTS are comparing it to the libraries listed below
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 4 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆68Updated 3 months ago
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆112Updated last month
- PyTorch implementation of StableMask (ICML'24)☆12Updated 8 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆51Updated 3 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆50Updated 4 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆49Updated last week
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆52Updated 3 months ago
- ☆27Updated last month
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆74Updated 5 months ago
- ☆34Updated 6 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆20Updated 9 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆61Updated last week
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆52Updated 6 months ago
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆18Updated 6 months ago
- ☆31Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated 7 months ago
- ☆73Updated 6 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆39Updated this week
- ☆16Updated 2 months ago
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆34Updated 3 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆39Updated 4 months ago
- Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxiang Li, Lu Yi…☆18Updated 2 months ago
- ☆25Updated last month
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆32Updated 3 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆36Updated 5 months ago
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆74Updated 9 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆65Updated 3 months ago
- ☆29Updated last month
- ☆167Updated last year