eminorhan / mixture-of-expertsLinks

Mixture of experts layers for Keras

☆95

Alternatives and similar repositories for mixture-of-experts

Users that are interested in mixture-of-experts are comparing it to the libraries listed below

Sorting:

johnsmithm / multi-heads-attention-image-classification
Multi heads attention for image classification
☆80Updated 7 years ago
benoitdescamps / Neural-Tree
Tensorflow implementation of a Tree
☆36Updated 6 years ago
AmazaspShumik / mtlearn
Multi-Task Learning package built with tensorflow 2 (Multi-Gate Mixture of Experts, Cross-Stitch, Ucertainty Weighting)
☆52Updated 5 years ago
hardianlawi / MTL-Homoscedastic-Uncertainty
Multi Task Learning Implementation with Homoscedastic Uncertainty in Tensorflow
☆53Updated 6 years ago
mhmoodlan / cyclic-learning-rate
Cyclic learning rate TensorFlow implementation.
☆66Updated 6 years ago
duchao0726 / DUAL
Code for paper "Exploration in Online Advertising Systems with Deep Uncertainty-Aware Learning"
☆64Updated 2 years ago
thomlake / pytorch-attention
pytorch neural network attention mechanism
☆147Updated 6 years ago
graytowne / rank_distill
A PyTorch implementation of Ranking Distillation
☆90Updated 4 years ago
taki0112 / AMSGrad-Tensorflow
Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)
☆104Updated 6 years ago
wOOL / DMTRL
[Code] Deep Multi-task Representation Learning: A Tensor Factorisation Approach
☆58Updated 8 years ago
lolemacs / soft-sharing
Implementation of soft parameter sharing for neural networks
☆69Updated 4 years ago
hav4ik / Hydra
Multi-Task Learning Framework on PyTorch. State-of-the-art methods are implemented to effectively train models on multiple tasks.
☆149Updated 6 years ago
Kyubyong / label_smoothing
Corrupted labels and label smoothing
☆129Updated 7 years ago
yihong-chen / lambda-opt
Pytorch implementation of λOpt: Learn to Regularize Recommender Models in Finer Levels, KDD 2019
☆53Updated 5 years ago
arthurdouillard / keras-snapshot_ensembles
Implementation in Keras of: Snapshot Ensembles: Train 1, get M for free (https://arxiv.org/abs/1704.00109)
☆26Updated 6 years ago
MattKleinsmith / pbt
Population Based Training (in PyTorch with sqlite3). Status: Unsupported
☆162Updated 7 years ago
js05212 / MXNet-for-CDL
Official MXNet code for 'Collaborative Deep Learning for Recommender Systems' - SIGKDD
☆53Updated 3 years ago
collinprather / SlateQ
A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms
☆37Updated 2 years ago
siavash-khodadadeh / MetaLearning-TF2.0
Meta learning framework with Tensorflow 2.0
☆118Updated 2 years ago
nw2190 / TensorFlow_Examples
Collection of TensorFlow Examples
☆37Updated 7 years ago
luochuwei / Custom-Optimizer-in-TensorFlow
Custom Optimizer in TensorFlow(定义你自己的Tensorflow Optimizer)
☆66Updated 5 years ago
JGuillaumin / swa-tf
Stochastic Weight Averaging - TensorFlow implementation
☆33Updated 6 years ago
WayneDW / DeepLight_Deep-Lightweight-Feature-Interactions
Accelerating Inference for Recommendation Systems (WSDM'21)
☆112Updated 4 years ago
zhengying-liu / autodl_starting_kit_stable
Starting kit for AutoCV/AutoDL challenge (https://autodl.chalearn.org)
☆40Updated 5 years ago
gaohuang / SnapshotEnsemble
Snapshot Ensembles in Torch (Snapshot Ensembles: Train 1, Get M for Free)
☆189Updated 8 years ago
jingxil / Neural-Decision-Forests
An implementation of the Deep Neural Decision Forests in PyTorch
☆163Updated 6 years ago
taki0112 / RAdam-Tensorflow
Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"
☆97Updated 5 years ago
cbvrp-acmmm-2019 / cbvrp-acmmm-2019
Website for CBVRP Grand Challenge in ACM Multimedia 2019
☆32Updated 5 years ago
mengf1 / PAL
Policy based Active Learning with DQN (EMNLP-2017)
☆90Updated 7 years ago
hosseinshn / GradNorm
This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018
☆179Updated 3 years ago