Oliver-FutureAI / Awesome-MoELinks
Awesome list of Mixture-of-Experts (MoE)
☆21Updated last year
Alternatives and similar repositories for Awesome-MoE
Users that are interested in Awesome-MoE are comparing it to the libraries listed below
Sorting:
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆94Updated last year
- Code for our ICML'24 on multimodal dataset distillation☆37Updated 9 months ago
- Code for ICML 2024 paper (Oral) — Test-Time Model Adaptation with Only Forward Passes☆81Updated 10 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆241Updated last year
- A pytorch implementation of CVPR24 paper "D4M: Dataset Distillation via Disentangled Diffusion Model"☆32Updated 10 months ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆101Updated last year
- [ICML 2023] On Pitfalls of Test-Time Adaptation☆120Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆129Updated 8 months ago
- ☆29Updated 2 years ago
- Code for ICLR 2023 paper (Oral) — Towards Stable Test-Time Adaptation in Dynamic Wild World☆191Updated last year
- Official implementation for 'Class-Balancing Diffusion Models'☆55Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆64Updated last month
- [ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning☆44Updated 3 weeks ago
- Efficient Dataset Distillation by Representative Matching☆112Updated last year
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆72Updated 4 months ago
- The official github repo for "Test-Time Training with Masked Autoencoders"☆86Updated last year
- ☆91Updated 2 years ago
- [TPAMI 2024] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition☆82Updated 9 months ago
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆69Updated 3 months ago
- Code for ICML 2022 paper — Efficient Test-Time Model Adaptation without Forgetting☆128Updated 2 years ago
- Distilling Dataset into Generative Models☆54Updated 2 years ago
- ☆111Updated last year
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Updated 9 months ago
- ☆16Updated last year
- ☆114Updated 2 years ago
- [ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"☆171Updated last year
- ☆39Updated 8 months ago
- AAAI 2024, M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy☆25Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆216Updated 7 months ago