choosewhatulike / sparse-sharing
Codes for "Learning Sparse Sharing Architectures for Multiple Tasks"
☆93Updated 4 years ago
Alternatives and similar repositories for sparse-sharing:
Users that are interested in sparse-sharing are comparing it to the libraries listed below
- pytorch gpu memory check☆50Updated 5 years ago
- A Tensorflow implementation of the paper arXiv:1604.03539☆130Updated 6 years ago
- The project including MMOE, SNR_trans, SNR_avg, PLE, etc implemented by pytorch.☆134Updated 4 years ago
- The most complete list of AI top meetings☆87Updated 5 years ago
- The code of Encoding Word Order in Complex-valued Embedding☆42Updated 5 years ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆264Updated 2 years ago
- KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall☆63Updated 4 years ago
- Code and dataset of AAAI2019 paper Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification☆189Updated 6 years ago
- ☆19Updated 2 years ago
- KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place☆191Updated 4 years ago
- UDA(Unsupervised Data Augmentation) implemented by pytorch☆276Updated 5 years ago
- This is our solution for KDD Cup 2020. We implemented a very neat and simple neural ranking model based on siamese BERT which ranked firs…☆71Updated 4 years ago
- Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"☆115Updated 2 years ago
- bert annotation, input and output for people from scratch, 代码注释, 有每一步的输入和输出, 适合初学者☆93Updated 2 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆202Updated 5 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆136Updated 6 years ago
- MSc group project: Reproduction of 'Multi-Task Learning using Uncertainty to Weigh Losses for Scene Geometry and Semantics'; A. Kendall, …☆89Updated 5 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆176Updated 3 years ago
- ☆20Updated 4 years ago
- ☆90Updated 3 years ago
- ☆56Updated 2 years ago
- 2021 huawei DIGIX competition baseline☆69Updated 3 years ago
- Adversarial Training for Natural Language Understanding☆252Updated last year
- MetaBalance algorithm for multi-task learning☆58Updated 3 years ago
- multi task mode for esmm and mmoe☆145Updated 3 years ago
- 注意力机制on自然语言处理文章整理笔记☆170Updated 6 years ago
- ☆83Updated 5 years ago
- A general framework for knowledge distillation☆54Updated 4 years ago
- ☆9Updated 4 years ago
- 论文阅读以及笔记☆31Updated 4 years ago