eminorhan / mixture-of-experts
Mixture of experts layers for Keras
☆94Updated 6 years ago
Alternatives and similar repositories for mixture-of-experts:
Users that are interested in mixture-of-experts are comparing it to the libraries listed below
- [Code] Deep Multi-task Representation Learning: A Tensor Factorisation Approach☆58Updated 7 years ago
- Multi-Task Learning package built with tensorflow 2 (Multi-Gate Mixture of Experts, Cross-Stitch, Ucertainty Weighting)☆51Updated 5 years ago
- Multi Task Learning Implementation with Homoscedastic Uncertainty in Tensorflow☆53Updated 6 years ago
- ZForcing Repo☆40Updated 7 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- Code release for "Learning Multiple Tasks with Multilinear Relationship Networks" (NIPS 2017)☆70Updated 7 years ago
- Code for "Active One-shot Learning"☆33Updated 6 years ago
- pytorch neural network attention mechanism☆147Updated 6 years ago
- ☆61Updated 2 years ago
- Custom Optimizer in TensorFlow(定义你自己的Tensorflow Optimizer)☆66Updated 5 years ago
- Code for paper "Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning"☆63Updated last year
- Pytorch implementation of λOpt: Learn to Regularize Recommender Models in Finer Levels, KDD 2019☆53Updated 4 years ago
- A PyTorch implementation of Ranking Distillation☆89Updated 4 years ago
- Latent Alignment and Variational Attention☆327Updated 6 years ago
- Tensorflow Implementation of One-Shot Learning with Memory Augmented Neural Network☆47Updated 7 years ago
- ☆25Updated 6 years ago
- Collection of TensorFlow Examples☆37Updated 6 years ago
- 6️⃣6️⃣6️⃣ Reproduce ICLR '18 under-reviewed paper "MULTI-TASK LEARNING ON MNIST IMAGE DATASETS"☆41Updated 6 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 6 years ago
- Replication of Semi-Supervised Learning with Deep Generative Models☆98Updated 8 years ago
- Tensorflow implementation of "Representation Learning with Contrastive Predictive Coding"☆64Updated 6 years ago
- Multi heads attention for image classification☆81Updated 6 years ago
- Official MXNet code for 'Collaborative Deep Learning for Recommender Systems' - SIGKDD☆53Updated 3 years ago
- Keras Implementation for "Deep & Cross Network for Ad Click Predictions"☆93Updated 3 years ago
- Code for "Gradient Surgery for Multi-Task Learning"☆315Updated 4 years ago
- Cyclic learning rate TensorFlow implementation.☆66Updated 6 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- Lifelong sequential modeling for user response prediction. A comprehensive evaluation framework for our SIGIR 2019 paper.☆102Updated 4 years ago
- Multi-Task Learning in NLP☆94Updated 7 years ago
- Efficient Neural Interaction Functions Search for Collaborative Filtering☆18Updated 5 years ago