Oliver-FutureAI / Awesome-MoELinks
Awesome list of Mixture-of-Experts (MoE)
☆24Updated last year
Alternatives and similar repositories for Awesome-MoE
Users that are interested in Awesome-MoE are comparing it to the libraries listed below
Sorting:
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆98Updated last year
- Code for our ICML'24 on multimodal dataset distillation☆41Updated last year
- Official implementation for 'Class-Balancing Diffusion Models'☆54Updated last year
- ☆30Updated 2 years ago
- [ICML 2023] On Pitfalls of Test-Time Adaptation☆124Updated last year
- [TPAMI 2024] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition☆87Updated last year
- [NeurIPS 2023] Generalized Logit Adjustment☆39Updated last year
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆78Updated last year
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Updated last year
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆245Updated last year
- ☆113Updated last year
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆17Updated 9 months ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆77Updated 9 months ago
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆27Updated 8 months ago
- ☆22Updated last year
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆104Updated last year
- Code for ICML 2024 paper (Oral) — Test-Time Model Adaptation with Only Forward Passes☆91Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆62Updated last year
- [ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning☆74Updated 3 weeks ago
- ☆16Updated last year
- This is the official PyTorch Implementation of "SoTTA: Robust Test-Time Adaptation on Noisy Data Streams (NeurIPS '23)" by Taesik Gong*, …☆22Updated last year
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆51Updated 11 months ago
- A pytorch implementation of CVPR24 paper "D4M: Dataset Distillation via Disentangled Diffusion Model"☆36Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆131Updated last year
- Efficient Dataset Distillation by Representative Matching☆113Updated last year
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆88Updated last year
- ☆91Updated 2 years ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆87Updated 2 months ago
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆30Updated last year
- Instruction Tuning in Continual Learning paradigm☆66Updated 9 months ago