eminorhan / mixture-of-expertsLinks
Mixture of experts layers for Keras
☆94Updated 6 years ago
Alternatives and similar repositories for mixture-of-experts
Users that are interested in mixture-of-experts are comparing it to the libraries listed below
Sorting:
- Multi-Task Learning package built with tensorflow 2 (Multi-Gate Mixture of Experts, Cross-Stitch, Ucertainty Weighting)☆52Updated 5 years ago
- Collection of TensorFlow Examples☆37Updated 6 years ago
- Custom Optimizer in TensorFlow(定义你自己的Tensorflow Optimizer)☆66Updated 5 years ago
- [Code] Deep Multi-task Representation Learning: A Tensor Factorisation Approach☆58Updated 7 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 6 years ago
- Code for paper "Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning"☆63Updated last year
- A PyTorch implementation of Ranking Distillation☆90Updated 4 years ago
- Pytorch implementation of λOpt: Learn to Regularize Recommender Models in Finer Levels, KDD 2019☆53Updated 4 years ago
- Cyclic learning rate TensorFlow implementation.☆66Updated 6 years ago
- Multi Task Learning Implementation with Homoscedastic Uncertainty in Tensorflow☆53Updated 6 years ago
- Official MXNet code for 'Collaborative Deep Learning for Recommender Systems' - SIGKDD☆53Updated 3 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- Tensorflow implementation of conditional variational auto-encoder for MNIST☆150Updated 8 years ago
- Keras Implementation for "Deep & Cross Network for Ad Click Predictions"☆93Updated 3 years ago
- Multi heads attention for image classification☆80Updated 7 years ago
- Implementation of Ladder Network in PyTorch.☆45Updated 7 years ago
- A Tensorflow implementation of the paper arXiv:1604.03539☆130Updated 6 years ago
- A chainer implementation of Memory Augmented Neural Network☆48Updated 8 years ago
- ☆81Updated 6 years ago
- Auto Encoders in PyTorch☆63Updated 7 years ago
- Lifelong sequential modeling for user response prediction. A comprehensive evaluation framework for our SIGIR 2019 paper.☆102Updated 4 years ago
- Code for "Active One-shot Learning"☆33Updated 6 years ago
- Accelerating Inference for Recommendation Systems (WSDM'21)☆112Updated 4 years ago
- Tensorflow Implementation of One-Shot Learning with Memory Augmented Neural Network☆47Updated 7 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆178Updated 3 years ago
- Code for "Gradient Surgery for Multi-Task Learning"☆323Updated 5 years ago
- some ctr model, implemented by PyTorch, such as Factorization Machines, Field-aware Factorization Machines, DeepFM, xDeepFM, Deep Interes…☆70Updated 6 years ago
- Implementation in Keras of: Snapshot Ensembles: Train 1, get M for free (https://arxiv.org/abs/1704.00109)☆26Updated 6 years ago
- ☆18Updated 4 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago