pittisl / mPnP-LLMLinks
Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"
☆11Updated last year
Alternatives and similar repositories for mPnP-LLM
Users that are interested in mPnP-LLM are comparing it to the libraries listed below
Sorting:
- ☆8Updated 7 months ago
- ☆12Updated 3 weeks ago
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆14Updated 2 weeks ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Updated last year
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 3 months ago
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆19Updated 2 weeks ago
- Hierarchical State Space Models☆47Updated last year
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆16Updated 7 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆22Updated 10 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆30Updated 8 months ago
- [ICLR 2023] CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding☆45Updated 2 weeks ago
- HGRN2: Gated Linear RNNs with State Expansion☆55Updated 10 months ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆16Updated 7 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels"☆14Updated 4 months ago
- More dimensions = More fun☆22Updated 11 months ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- Collect papers about Mamba (a selective state space model).☆14Updated 10 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆20Updated 2 months ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆39Updated last year
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆41Updated 7 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago
- Large Language and Sensor Assistant: Multimodal LLM for sensor and human activity interpretation☆29Updated last month
- Project for SNARE benchmark☆11Updated last year
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆14Updated last year
- ☆24Updated last year
- Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)☆93Updated last year
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆32Updated last year
- This repository is a collection of research papers on World Models.☆39Updated last year
- Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023☆13Updated 2 years ago