eminorhan / mixture-of-experts
Mixture of experts layers for Keras
☆94Updated 6 years ago
Alternatives and similar repositories for mixture-of-experts
Users that are interested in mixture-of-experts are comparing it to the libraries listed below
Sorting:
- Multi-Task Learning package built with tensorflow 2 (Multi-Gate Mixture of Experts, Cross-Stitch, Ucertainty Weighting)☆52Updated 5 years ago
- [Code] Deep Multi-task Representation Learning: A Tensor Factorisation Approach☆58Updated 7 years ago
- Website for CBVRP Grand Challenge in ACM Multimedia 2019☆32Updated 5 years ago
- Multi Task Learning Implementation with Homoscedastic Uncertainty in Tensorflow☆53Updated 6 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 6 years ago
- Cyclic learning rate TensorFlow implementation.☆66Updated 6 years ago
- A PyTorch implementation of Ranking Distillation☆90Updated 4 years ago
- Multi-Task Learning Framework on PyTorch. State-of-the-art methods are implemented to effectively train models on multiple tasks.☆149Updated 5 years ago
- PyTorch implementations of LSTM Variants (Dropout + Layer Norm)☆136Updated 4 years ago
- Stochastic Weight Averaging - TensorFlow implementation☆33Updated 6 years ago
- Code for "Active One-shot Learning"☆33Updated 6 years ago
- Code release for "Learning Multiple Tasks with Multilinear Relationship Networks" (NIPS 2017)☆70Updated 7 years ago
- ☆81Updated 6 years ago
- Code for paper "Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning"☆63Updated last year
- Official MXNet code for 'Collaborative Deep Learning for Recommender Systems' - SIGKDD☆53Updated 3 years ago
- Custom Optimizer in TensorFlow(定义你自己的Tensorflow Optimizer)☆66Updated 5 years ago
- Lifelong sequential modeling for user response prediction. A comprehensive evaluation framework for our SIGIR 2019 paper.☆102Updated 4 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- One-Shot Learning using Nearest-Neighbor Search (NNS) and Locality-Sensitive Hashing LSH☆74Updated 6 years ago
- Implementation of Ladder Network in PyTorch.☆45Updated 7 years ago
- Pytorch implementation of λOpt: Learn to Regularize Recommender Models in Finer Levels, KDD 2019☆53Updated 4 years ago
- A Tensorflow implementation of the paper arXiv:1604.03539☆130Updated 6 years ago
- Keras callback function for stochastic weight averaging☆56Updated 2 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆178Updated 3 years ago
- Code for "Gradient Surgery for Multi-Task Learning"☆321Updated 5 years ago
- ☆61Updated 2 years ago
- train models in pytorch, Learn to Rank, Collaborative Filter, Heterogeneous Treatment Effect, Uplift Modeling, etc☆182Updated 3 months ago
- Accelerating Inference for Recommendation Systems (WSDM'21)☆112Updated 4 years ago
- Models and examples built with Chainer☆119Updated 2 years ago
- some ctr model, implemented by PyTorch, such as Factorization Machines, Field-aware Factorization Machines, DeepFM, xDeepFM, Deep Interes…☆70Updated 6 years ago