eng-amrahmed / reptile-tf2Links
A TensorFlow 2.0 with eager execution implementation of Pytorch OpenAI few-shot regression toy example
☆16Updated 6 years ago
Alternatives and similar repositories for reptile-tf2
Users that are interested in reptile-tf2 are comparing it to the libraries listed below
Sorting:
- TensorFlow 2.0 implementation of MAML.☆83Updated 6 years ago
- Feature selection for maximizing expected cumulative reward☆30Updated 7 years ago
- References at the Intersection of Causality and Reinforcement Learning☆89Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Code for AAAI 2018 accepted paper: "Beyond Sparsity: Tree Regularization of Deep Models for Interpretability"☆78Updated 7 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- ☆29Updated 5 years ago
- Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)☆187Updated 5 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆88Updated 4 years ago
- Adaptive Neural Trees☆155Updated 6 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Implementation of Deep Temporal Clustering.☆74Updated 2 years ago
- ☆53Updated 5 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆82Updated 6 years ago
- Thompson Sampling Tutorial☆54Updated 6 years ago
- Gated Recurrent Unit with a Decay mechanism for Multivariate Time Series with Missing Values☆119Updated 6 years ago
- Code for performing 3 multitask machine learning methods: deep neural networks, Multitask Multi-kernel Learning (MTMKL), and a hierarchic…☆132Updated 3 years ago
- Kernel Change-point Detection with Auxiliary Deep Generative Models (ICLR 2019 paper)☆59Updated 2 years ago
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆13Updated 2 years ago
- Greedy Gaussian Segmentation☆100Updated 2 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆96Updated 3 years ago
- Deep Neural Network Ensembles for Time Series Classification☆111Updated 2 years ago
- ☆83Updated 6 years ago
- Implementation of "A Simple Neural Attentive Meta-Learner" (SNAIL, https://arxiv.org/pdf/1707.03141.pdf) in PyTorch☆148Updated 6 years ago
- Modular PyTorch implementation of policy gradient methods☆25Updated 6 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 5 years ago
- A modular toolbox for meta-learning research with a focus on speed and reproducibility.☆125Updated 2 years ago
- This repository contains code for the paper: https://arxiv.org/abs/1905.03806. It also contains scripts to reproduce the results in the p…☆168Updated 5 years ago
- ☆185Updated 7 years ago