gstoica27 / KnOTSLinks

Model Merging with SVD to Tie the KnOTS [ICLR 2025]

☆59

Alternatives and similar repositories for KnOTS

Users that are interested in KnOTS are comparing it to the libraries listed below

Sorting:

UCDvision / NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
☆56Updated 10 months ago
wang-kee / LiNeS
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆29Updated 8 months ago
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆57Updated 7 months ago
EvolvingLMMs-Lab / multimodal-sae
[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆144Updated this week
Wang-ML-Lab / multimodal-needle-in-a-haystack
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models
☆47Updated 2 months ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆27Updated 2 months ago
mrflogs / LoRA-Pro
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
☆126Updated 3 months ago
AtsuMiyai / UPD
[ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models
☆77Updated last month
roymiles / VeLoRA
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆21Updated 9 months ago
pixeli99 / MixLN
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆25Updated 6 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆99Updated last week
NUS-HPC-AI-Lab / DD-Ranking
Data distillation benchmark
☆66Updated last month
zhixuan-lin / forgetting-transformer
[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"
☆116Updated last week
NUS-HPC-AI-Lab / Recurrent-Parameter-Generation
The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.
☆57Updated 5 months ago
mu-cai / matryoshka-mm
Matryoshka Multimodal Models
☆111Updated 5 months ago
kuleshov-group / remdm
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆34Updated 4 months ago
TianjinYellow / SPAM-Optimizer
☆33Updated 4 months ago
chenllliang / DnD-Transformer
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆76Updated 7 months ago
Qichuzyy / POA
Official implementation of ECCV24 paper: POA
☆24Updated 11 months ago
multimodal-interpretability / maia
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
☆82Updated 3 weeks ago
MikaStars39 / StableMask
PyTorch implementation of StableMask (ICML'24)
☆13Updated last year
nbasyl / DoRA
Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"
☆124Updated last year
zju-vipa / training_free_model_merging
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆30Updated last year
Infini-AI-Lab / S2FT
☆18Updated 6 months ago
visual-haystacks / vhs_benchmark
🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
☆29Updated 5 months ago
horseee / dKV-Cache
☆88Updated last month
yossigandelsman / second_order_lens
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
☆39Updated 8 months ago
ByungKwanLee / Phantom
[Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…
☆60Updated 9 months ago
katiekang1998 / reasoning_generalization
☆33Updated 6 months ago
shulin16 / MMInA
Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"
☆46Updated 4 months ago