Oliver-FutureAI / Awesome-MoE
Awesome list of Mixture-of-Experts (MoE)
☆18Updated 9 months ago
Alternatives and similar repositories for Awesome-MoE:
Users that are interested in Awesome-MoE are comparing it to the libraries listed below
- Official implementation for 'Class-Balancing Diffusion Models'☆52Updated 10 months ago
- A pytorch implementation of CVPR24 paper "D4M: Dataset Distillation via Disentangled Diffusion Model"☆28Updated 6 months ago
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆27Updated 5 months ago
- ☆10Updated 2 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆43Updated 2 months ago
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆66Updated 2 months ago
- ☆106Updated last year
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆91Updated last year
- Code for our ICML'24 on multimodal dataset distillation☆36Updated 5 months ago
- Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token …☆39Updated 3 months ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆102Updated 10 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆175Updated 3 months ago
- Data distillation benchmark☆58Updated this week
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 10 months ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆125Updated 4 months ago
- ☆42Updated 2 years ago
- ☆15Updated 9 months ago
- [ICLR 2025] SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training☆16Updated last week
- [ICCV 2023] A Unified Continual Learning Framework with General Parameter-Efficient Tuning☆78Updated 5 months ago
- The official github repo for "Test-Time Training with Masked Autoencoders"☆80Updated last year
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆56Updated 11 months ago
- ☆42Updated last year
- This is the official PyTorch Implementation of "SoTTA: Robust Test-Time Adaptation on Noisy Data Streams (NeurIPS '23)" by Taesik Gong*, …☆20Updated last year
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Updated 2 years ago
- ☆127Updated 9 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆64Updated last month
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆69Updated last year
- ☆28Updated 2 years ago
- Code for "Training on Thin Air: Improve Image Classification with Generated Data"☆45Updated last year