pittisl / mPnP-LLMLinks

Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"

☆11

Alternatives and similar repositories for mPnP-LLM

Users that are interested in mPnP-LLM are comparing it to the libraries listed below

Sorting:

CatworldLee / Gaussian-Mixture-Mask-Attention
☆8Updated 8 months ago
ethanlshen / HierNet
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆21Updated last year
mbzuai-oryx / Agent-X
Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks
☆14Updated last month
tianyi-lab / R2-T2
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆15Updated 4 months ago
top-yun / SPARK
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
☆18Updated 6 months ago
ethanbar11 / ssm_2d
More dimensions = More fun
☆22Updated 11 months ago
leo-yangli / VB-LoRA
This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).
☆39Updated 9 months ago
leaf1170124460 / Mask3D-SHIFT
This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.
☆14Updated last year
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆57Updated 7 months ago
kyegomez / PaLM2-VAdapter
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆16Updated 8 months ago
kyegomez / Hedgehog
Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"
☆14Updated last year
si0wang / ViCrit
☆18Updated last month
GuochenZhou / World-Model
A paper list of world model
☆28Updated 3 months ago
AtsuMiyai / rethinking_rotation
[WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…
☆12Updated 2 years ago
minhoooo1 / CatMAE
CatMAE
☆14Updated last year
UCDvision / NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
☆56Updated 10 months ago
4m4n5 / CLIP-Lite
Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023
☆13Updated 2 years ago
twinkle0331 / LGTM
[ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…
☆38Updated 2 years ago
mlvlab / TokenMixup
Official pytorch implementation of NeurIPS 2022 paper, TokenMixup
☆48Updated 2 years ago
kyegomez / MAGVIT2
Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"
☆15Updated 8 months ago
fistyee / MixPro
🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]
☆21Updated last year
lapisrocks / DiscreteAdversarialDistillation
[NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"
☆12Updated last year
google-research / fooling-feature-visualizations
Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)
☆32Updated last year
vita-epfl / motion-style-transfer
[CoRL22] Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecasting
☆22Updated 2 years ago
UCDvision / PRANC
☆23Updated 2 years ago
Christina200 / Online-LoRA-official
[WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…
☆46Updated 8 months ago
james-oldfield / muMoE
[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
☆33Updated 9 months ago
mshukor / eP-ALM
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Updated last year
ExplainableML / sae-for-vlm
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
☆24Updated 3 months ago
PKU-ML / non_neg
Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning
☆45Updated last year