ambisinister / lossfreebalance
toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts
☆13Updated 8 months ago
Alternatives and similar repositories for lossfreebalance
Users that are interested in lossfreebalance are comparing it to the libraries listed below
Sorting:
- Adapting LLaMA Decoder to Vision Transformer☆28Updated 11 months ago
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501☆55Updated 9 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆100Updated last month
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆48Updated 2 months ago
- Mixture of Attention Heads☆44Updated 2 years ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆116Updated last month
- CLIP-MoE: Mixture of Experts for CLIP☆34Updated 7 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 3 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆51Updated last month
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆34Updated 2 months ago
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆57Updated 7 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆36Updated 3 months ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆28Updated 4 months ago
- ☆101Updated 10 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆141Updated 3 months ago
- [CVPR2024] ModaVerse: Efficiently Transforming Modalities with LLMs☆29Updated 10 months ago
- Recent Advances on MLLM's Reasoning Ability☆25Updated last month
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆102Updated 10 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆47Updated last month
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆20Updated last week
- ☆44Updated last week
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning☆49Updated last year
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆127Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆29Updated 7 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆54Updated 4 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 7 months ago
- ☆54Updated last year
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆36Updated 5 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆44Updated 10 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 6 months ago