ambisinister / lossfreebalanceLinks

toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

☆15

Alternatives and similar repositories for lossfreebalance

Users that are interested in lossfreebalance are comparing it to the libraries listed below

Sorting:

ML-GSAI / Diffusion-LLM-Papers
A Collection of Papers on Diffusion Language Models
☆81Updated this week
wutaiqiang / MoSLoRA
☆105Updated 11 months ago
zhijie-group / Orthus
☆37Updated last month
techmonsterwang / iLLaMA
Adapting LLaMA Decoder to Vision Transformer
☆28Updated last year
yuecao0119 / MMInstruct
[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…
☆54Updated 7 months ago
Kevinz-code / SeVa
[MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501
☆55Updated 10 months ago
mrflogs / LoRA-Pro
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
☆120Updated 2 months ago
ywh187 / FitPrune
☆48Updated last month
OpenSparseLLMs / Skip-DiT
✈️ Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
☆69Updated 2 months ago
DavidFanzz / SCMoE
☆26Updated last year
SUSTechBruce / LOOK-M
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆97Updated 7 months ago
OpenGVLab / Mono-InternVL
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆46Updated 2 months ago
ThisisBillhe / ZipAR
[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…
☆49Updated 2 months ago
ApexGen-X / MergeVQ
[CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization
☆31Updated last week
OpenSparseLLMs / MoM
☆84Updated 2 months ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆142Updated 4 months ago
chuanyang-Zheng / DAPE
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆38Updated 8 months ago
JinXins / Awesome-Token-Merge-for-MLLMs
A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.
☆62Updated 5 months ago
NUS-HPC-AI-Lab / Dynamic-Tuning
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆46Updated 5 months ago
TencentARC / GVT
Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".
☆58Updated last year
ShadeCloak / ADORA
☆46Updated 2 months ago
OpenGVLab / PVC
[CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
☆42Updated last week
RifleZhang / LLaVA-Reasoner-DPO
☆78Updated 5 months ago
lzhxmu / VTW
Code release for VTW (AAAI 2025) Oral
☆43Updated 5 months ago
NUS-TRAIL / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆67Updated 2 weeks ago
UMass-Embodied-AGI / Mod-Squad
☆90Updated 2 years ago
findalexli / mllm-dpo
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆46Updated 7 months ago
horseee / dKV-Cache
☆82Updated last month
TencentARC / pi-Tuning
Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
☆33Updated last year
JieShibo / MemVP
[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
☆49Updated last year