tianyi-lab/R2-T2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tianyi-lab/R2-T2)

tianyi-lab / R2-T2

[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"

☆19

Alternatives and similar repositories for R2-T2

Users that are interested in R2-T2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
tianyi-lab / CoSTAR
View on GitHub
Cost-Sensitive Toolpath Agent for Multi-turn Image Editing
☆31Mar 26, 2025Updated last year
tianyi-lab / FaSTAR
View on GitHub
[ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing
☆33May 30, 2026Updated last month
MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
qizhou000 / LiveEdit
View on GitHub
[CVPR 2025] Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts
☆24Jun 22, 2025Updated last year
tianyi-lab / ColorBench
View on GitHub
[NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…
☆40Sep 27, 2025Updated 9 months ago
tianyi-lab / RoMA
View on GitHub
Code for "Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs"
☆19Nov 6, 2025Updated 8 months ago
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 9 months ago
tianyi-lab / MoE-Embedding
View on GitHub
[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆92Oct 15, 2024Updated last year
divyakraman / AerialDiffusion
View on GitHub
Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
☆13Oct 3, 2023Updated 2 years ago
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
kai-wen-yang / IDAA
View on GitHub
[ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"
☆10Jul 24, 2022Updated 4 years ago
Bigyehahaha / M4
View on GitHub
The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》
☆14Mar 31, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ttw1018 / MoPE-DST
View on GitHub
The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"
☆19Jan 25, 2025Updated last year
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated last month
zwbx / Chain-of-Action
View on GitHub
☆18Jul 8, 2025Updated last year
wrmedford / moe-scaling
View on GitHub
Scaling Laws for Mixture of Experts Models
☆15Feb 25, 2025Updated last year
zdebruine / MMVAE
View on GitHub
Mixture-of-Experts Multimodal Variational Autoencoder
☆15Jul 3, 2025Updated last year
kyegomez / MHMoE
View on GitHub
Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆30Updated this week
Kwai-Klear / CE-GPPO
View on GitHub
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
☆16Jan 23, 2026Updated 6 months ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JarvisPei / CMoE
View on GitHub
[ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis
☆46Jun 30, 2026Updated 3 weeks ago
eliahuhorwitz / ProbeX
View on GitHub
Official PyTorch Implementation for the "Learning on Model Weights using Tree Experts" paper (CVPR 2025).
☆16Feb 11, 2026Updated 5 months ago
ChenZiHong-Gavin / MoE-Visualizer
View on GitHub
MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.
☆16Apr 8, 2025Updated last year
xirui-li / MOSSBench
View on GitHub
An implementation for MLLM oversensitivity evaluation
☆18Nov 16, 2024Updated last year
Taishi-N324 / Drop-Upcycling
View on GitHub
[ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
☆25Oct 5, 2025Updated 9 months ago
zxiangx / LC-R1
View on GitHub
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆29Oct 20, 2025Updated 9 months ago
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
tianyi-lab / Mosaic-IT
View on GitHub
[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
☆20Sep 27, 2025Updated 9 months ago
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
ECNU-ICALK / CL-MoE
View on GitHub
[CVPR 2025] CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answeri…
☆60Jun 16, 2025Updated last year
wuxiyang1996 / AutoHallusion
View on GitHub
AutoHallusion Codebase (EMNLP 2024)
☆23Dec 6, 2024Updated last year
The-Swarm-Corporation / Mamba-R1
View on GitHub
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Oct 13, 2025Updated 9 months ago
NadavSc / Diff-Mamba
View on GitHub
☆22Jan 23, 2026Updated 6 months ago
purbeshmitra / MOTIF
View on GitHub
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
☆17Jul 6, 2025Updated last year
ExplainableML / fomo_in_flux
View on GitHub
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆62Dec 10, 2024Updated last year