gstoica27 / KnOTS
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆42Updated 3 weeks ago
Alternatives and similar repositories for KnOTS:
Users that are interested in KnOTS are comparing it to the libraries listed below
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 3 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆67Updated 2 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆51Updated 5 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆31Updated 3 months ago
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆104Updated 3 weeks ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆73Updated 5 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆51Updated 2 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 6 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆39Updated 3 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆25Updated last week
- ☆26Updated last month
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆96Updated 4 months ago
- ☆166Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆87Updated this week
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆48Updated 2 months ago
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆17Updated 6 months ago
- [NAACL 2025] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆38Updated this week
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆89Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last week
- This repo is based on https://github.com/jiaweizzhao/GaLore☆24Updated 5 months ago
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆33Updated 3 months ago
- Official implementation for Rare-to-Frequent (R2F), ICLR'25, Spotlight☆36Updated last week
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆44Updated 3 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆70Updated last year
- ☆71Updated 6 months ago
- ☆40Updated 7 months ago
- Data distillation benchmark☆52Updated this week
- This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆128Updated 8 months ago
- ☆21Updated last month