VITA-Group / UMECLinks
[ICLR 2021] "UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems" by Jiayi Shen, Haotao Wang*, Shupeng Gui*, Jianchao Tan, Zhangyang Wang, and Ji Liu
☆39Updated 3 years ago
Alternatives and similar repositories for UMEC
Users that are interested in UMEC are comparing it to the libraries listed below
Sorting:
- Zero-Shot Knowledge Distillation in Deep Networks☆67Updated 3 years ago
- Code for paper "Continual and Multi-Task Architecture Search (ACL 2019)"☆41Updated 6 years ago
- ☆52Updated 4 years ago
- The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…☆48Updated 4 years ago
- Unsupervised Data Augmentation experiments in PyTorch☆60Updated 6 years ago
- Zero-Shot Knowledge Distillation in Deep Networks in ICML2019☆49Updated 6 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆70Updated 5 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆34Updated 5 years ago
- Code for SelfAugment☆27Updated 4 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- NeurIPS'18: Masking: A New Perspective of Noisy Supervision☆54Updated 6 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆102Updated 5 years ago
- (ICCV 2019 Oral) Many Task Learning With Task Routing http://openaccess.thecvf.com/content_ICCV_2019/html/Strezoski_Many_Task_Learning_Wi…☆64Updated 5 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Updated 3 years ago
- ☆42Updated 4 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- official pytorch implementation of "Deep Metric Learning with Spherical Embedding", NeurIPS 2020☆41Updated 4 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆61Updated 5 years ago
- Role-Wise Data Augmentation for Knowledge Distillation☆19Updated 2 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆69Updated 2 years ago
- Improving generalization by controlling label-noise information in neural network weights.☆40Updated 4 years ago
- Code for NeurIPS 2019 Paper, "L_DMI: An Information-theoretic Noise-robust Loss Function"☆119Updated 2 years ago
- Gradients as Features for Deep Representation Learning☆43Updated 5 years ago
- Learning To Stop While Learning To Predict☆34Updated 2 years ago
- [ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"☆92Updated 2 years ago
- Unofficial pytorch implementation of Born-Again Neural Networks.☆55Updated 4 years ago
- The implementation of multi-branch attentive Transformer (MAT).☆33Updated 4 years ago
- Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020☆20Updated 5 years ago
- Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons (AAAI 2019)☆105Updated 5 years ago
- [CVPR 2020] Code for paper "AdversarialNAS: Adversarial Neural Architecture Search for GANs".☆71Updated last year