eminorhan / mixture-of-experts
Mixture of experts layers for Keras
☆94Updated 6 years ago
Alternatives and similar repositories for mixture-of-experts:
Users that are interested in mixture-of-experts are comparing it to the libraries listed below
- Multi-Task Learning package built with tensorflow 2 (Multi-Gate Mixture of Experts, Cross-Stitch, Ucertainty Weighting)☆51Updated 5 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 5 years ago
- [Code] Deep Multi-task Representation Learning: A Tensor Factorisation Approach☆58Updated 7 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- A PyTorch implementation of Ranking Distillation☆88Updated 4 years ago
- Code release for "Learning Multiple Tasks with Multilinear Relationship Networks" (NIPS 2017)☆70Updated 7 years ago
- Multi Task Learning Implementation with Homoscedastic Uncertainty in Tensorflow☆52Updated 6 years ago
- Code for paper "Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning"☆63Updated last year
- Official MXNet code for 'Collaborative Deep Learning for Recommender Systems' - SIGKDD☆53Updated 3 years ago
- Custom Optimizer in TensorFlow(定义你自己的Tensorflow Optimizer)☆66Updated 5 years ago
- 6️⃣6️⃣6️⃣ Reproduce ICLR '18 under-reviewed paper "MULTI-TASK LEARNING ON MNIST IMAGE DATASETS"☆41Updated 6 years ago
- A Tensorflow implementation of the paper arXiv:1604.03539☆127Updated 6 years ago
- Multi heads attention for image classification☆81Updated 6 years ago
- Policy based Active Learning with DQN (EMNLP-2017)☆88Updated 6 years ago
- ZForcing Repo☆40Updated 7 years ago
- Lifelong sequential modeling for user response prediction. A comprehensive evaluation framework for our SIGIR 2019 paper.☆102Updated 4 years ago
- A Tensorflow implementation of the Deep Listwise Context Model (DLCM) for ranking refinement.☆136Updated 2 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆93Updated 5 years ago
- pytorch neural network attention mechanism☆147Updated 5 years ago
- ReGAN: Sequence GAN using RE[INFORCE|LAX|BAR] based PG estimators☆40Updated 6 years ago
- ☆61Updated last year
- Tensorflow Implementation of One-Shot Learning with Memory Augmented Neural Network☆47Updated 7 years ago
- Minimal Tensorflow implementation of the paper "Neural Architecture Search With Reinforcement Learning" presented at ICLR 2017☆41Updated 7 years ago
- Pytorch implementation of λOpt: Learn to Regularize Recommender Models in Finer Levels, KDD 2019☆53Updated 4 years ago
- A PyTorch implementation of the blocks from the _A Simple Neural Attentive Meta-Learner_ paper☆98Updated 6 years ago
- Cyclic learning rate TensorFlow implementation.☆66Updated 5 years ago
- Implementation in Keras of: Snapshot Ensembles: Train 1, get M for free (https://arxiv.org/abs/1704.00109)☆25Updated 6 years ago
- Code for "Gradient Surgery for Multi-Task Learning"☆311Updated 4 years ago
- Implementation of Ladder Network in PyTorch.☆45Updated 7 years ago
- Matching Networks for one-shot learning in tensorflow (NIPS'16)☆56Updated 6 years ago