tianyi-lab / R2-T2
Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆14Updated 2 weeks ago
Alternatives and similar repositories for R2-T2:
Users that are interested in R2-T2 are comparing it to the libraries listed below
- ☆39Updated 4 months ago
- ☆35Updated 3 weeks ago
- ☆13Updated 2 months ago
- ☆15Updated 8 months ago
- ☆16Updated last month
- Code for T-MARS data filtering☆35Updated last year
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated 9 months ago
- Control LLM☆12Updated 2 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆33Updated last week
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16Updated 10 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆29Updated 8 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 5 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆41Updated 3 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- ☆16Updated 2 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆19Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆13Updated last month
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆19Updated 3 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆40Updated 2 weeks ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆13Updated 3 months ago
- ☆30Updated 2 months ago
- PyTorch implementation of StableMask (ICML'24)☆12Updated 9 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆16Updated 8 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆23Updated last week
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆16Updated 11 months ago
- ☆18Updated 4 months ago
- Project for SNARE benchmark☆10Updated 9 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆43Updated 7 months ago