RoyZry98 / T-REX-PytorchLinks
[Arxiv 2025] Official code for T-REX: Mixture-of-Rank-One-Experts with semantic-aware Intuition for Multi-task Large Language Model Finetuning
☆17Updated 8 months ago
Alternatives and similar repositories for T-REX-Pytorch
Users that are interested in T-REX-Pytorch are comparing it to the libraries listed below
Sorting:
- Official repository for VisionZip (CVPR 2025)☆405Updated 6 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆237Updated last month
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆48Updated 8 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆824Updated last week
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆418Updated 9 months ago
- A collection of token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI☆315Updated this week
- [NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆86Updated 4 months ago
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆787Updated 3 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆349Updated last month
- a brief repo about paper research☆15Updated last year
- Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token …☆40Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆433Updated 6 months ago
- Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)☆691Updated 4 months ago
- Official repository of Vision Test-Time Training☆49Updated 2 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆429Updated 7 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆553Updated last year
- [AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)☆43Updated 8 months ago
- 📚 Collection of token-level model compression resources.☆190Updated 5 months ago
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆597Updated 3 weeks ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆424Updated last year
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆95Updated last year
- [TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198☆299Updated this week
- ☆41Updated 10 months ago
- [NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks☆134Updated last year
- The first decoder-only multimodal state space model☆100Updated 8 months ago
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆63Updated 11 months ago
- 📚 Collection of awesome generation acceleration resources.☆387Updated 7 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,865Updated last month
- ☆20Updated last week
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆856Updated 8 months ago