AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
☆48Oct 21, 2022Updated 3 years ago
Alternatives and similar repositories for AutoMoE
Users that are interested in AutoMoE are comparing it to the libraries listed below
Sorting:
- ☆27Jul 25, 2023Updated 2 years ago
- C++ functions for reading and writing .pfm images to and from opencv Mat object☆12Jul 9, 2015Updated 10 years ago
- ☆27Jan 14, 2025Updated last year
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆10Nov 15, 2021Updated 4 years ago
- Android Native API's For Android JS☆10Oct 14, 2022Updated 3 years ago
- Example of binding a TF32 CUTLASS GEMM kernel to PyTorch☆12Jun 7, 2024Updated last year
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆17Jun 22, 2022Updated 3 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 5 months ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Sep 11, 2023Updated 2 years ago
- ☆18Nov 6, 2019Updated 6 years ago
- [SDM'23] ML4C: Seeing Causality Through Latent Vicinity☆14Nov 9, 2022Updated 3 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Trajectory tracking control for wheeled mobile robots in a robot soccer field using Fuzzy Logic.☆12Jul 18, 2021Updated 4 years ago
- ☆19Sep 15, 2022Updated 3 years ago
- This is the implementation code for the WWW2021 paper "Variation Control and Evaluation for Generative Slate Recommendation"☆15Jun 7, 2021Updated 4 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- Repository that contains the code for the paper titled, 'Unifying Distillation with Personalization in Federated Learning'.☆13May 31, 2021Updated 4 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆25Oct 5, 2025Updated 5 months ago
- ☆14Feb 24, 2026Updated 3 weeks ago
- ATC23 AE☆46May 11, 2023Updated 2 years ago
- A pytorch implementation of focal loss☆10Jan 9, 2020Updated 6 years ago
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 8 months ago
- Transformer-based Long Document Classification☆17Nov 2, 2022Updated 3 years ago
- Code and data for Veridicality classifier on Twitter☆11May 23, 2018Updated 7 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- ☆10Nov 21, 2023Updated 2 years ago
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 2 years ago
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- Source code to reproduce experiments from Mendez et al., ICLR '22☆22Jul 29, 2022Updated 3 years ago
- Data-driven offline simulation for online reinforcement learning: benchmark and baselines☆31Jul 25, 2024Updated last year
- Runtime for deep learning workload☆21May 24, 2022Updated 3 years ago
- ☆18Jun 5, 2024Updated last year
- Edutainment game teaching players concepts around machine learning☆15Feb 18, 2020Updated 6 years ago
- CSuite: A Suite of Benchmark Datasets for Causality☆82May 9, 2023Updated 2 years ago
- GitHub action to update the artifact of a plan within the Azure partner center offer.☆21Feb 8, 2025Updated last year
- This is the implementation of the TextNAS algorithm proposed in the paper TextNAS: A Neural Architecture Search Space tailored for Text R…☆15Nov 28, 2022Updated 3 years ago
- Quick useful examples of data science & ML & big data☆15Jun 12, 2023Updated 2 years ago
- Python library for real-time control of a robotic manipulator☆21Feb 7, 2023Updated 3 years ago