AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
☆48Oct 21, 2022Updated 3 years ago
Alternatives and similar repositories for AutoMoE
Users that are interested in AutoMoE are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 5 months ago
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆17Jun 22, 2022Updated 3 years ago
- [SDM'23] ML4C: Seeing Causality Through Latent Vicinity☆14Nov 9, 2022Updated 3 years ago
- Trajectory tracking control for wheeled mobile robots in a robot soccer field using Fuzzy Logic.☆12Jul 18, 2021Updated 4 years ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆13Feb 12, 2026Updated 2 weeks ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Reso…☆18Oct 9, 2025Updated 4 months ago
- Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ☆15Jul 5, 2025Updated 7 months ago
- microsoft / Azure-Databricks-Solution-Accelerator-Financial-Analytics-Customer-Revenue-Growth-Factor☆14Nov 4, 2022Updated 3 years ago
- This repository contains the resources used for presentation/discussion in weekly iRE Lab meetings.☆14Sep 8, 2017Updated 8 years ago
- Azure Object Detection Accelerator. A repo for quickly and easily setting up a sample object detection project with training, labelling, …☆20May 23, 2023Updated 2 years ago
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 2 years ago
- ☆19Sep 15, 2022Updated 3 years ago
- This repository contains the publishable code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of …☆24Apr 11, 2023Updated 2 years ago
- ☆18Nov 6, 2019Updated 6 years ago
- Federated Bilevel Optimization☆16Jun 23, 2022Updated 3 years ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆23Jun 24, 2023Updated 2 years ago
- GitHub action to update the artifact of a plan within the Azure partner center offer.☆21Feb 8, 2025Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆22Nov 3, 2023Updated 2 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆21Nov 29, 2020Updated 5 years ago
- Source code to reproduce experiments from Mendez et al., ICLR '22☆22Jul 29, 2022Updated 3 years ago
- ☆23Jun 7, 2023Updated 2 years ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆61Jul 22, 2025Updated 7 months ago
- Fault-aware neural code rankers☆32Dec 9, 2022Updated 3 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- Representation Learning and Pairwise Ranking for Implicit Feedback in Top-N Item Recommendation☆23Dec 26, 2017Updated 8 years ago
- ☆26Mar 17, 2023Updated 2 years ago
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆26Mar 18, 2024Updated last year
- A block pruning framework for LLMs.☆28May 17, 2025Updated 9 months ago
- ☆30Jun 22, 2020Updated 5 years ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆29Jul 6, 2023Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 4 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Dataset with coverage annotations for HumanEval dataset☆24Aug 17, 2023Updated 2 years ago
- Factorized Neural Layers☆31Jul 11, 2023Updated 2 years ago