j2kun / ucb1Links

The code for the post "Optimism in the Face of Uncertainty: the UCB1 Algorithm"

☆37

Alternatives and similar repositories for ucb1

Users that are interested in ucb1 are comparing it to the libraries listed below

Sorting:

sisl / Chimp
General purpose framework for deep reinforcement learning
☆71Updated 8 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
devsisters / neural-combinatorial-rl-tensorflow
in progress
☆108Updated 8 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
tor / libbandit
Library for Multi-Armed Bandit Algorithms
☆58Updated 8 years ago
dustinvtran / bayesrl
A Python library for reinforcement learning using Bayesian approaches
☆54Updated 10 years ago
shakedshammah / failures_of_DL
☆90Updated 7 years ago
5vision / DARQN
Deep Attention Recurrent Q-Network
☆115Updated 9 years ago
moskomule / pytorch.rl.learning
for learning reinforcement learning using PyTorch.
☆64Updated 5 years ago
mhauskn / dqn-hfo
☆79Updated 6 years ago
tanmayshankar / RCNN_MDP
Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.
☆69Updated 7 years ago
nutszebra / neural_architecture_search_with_reinforcement_learning_appendix_a
Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer
☆55Updated 6 years ago
Islandman93 / reinforcepy
Collection of reinforcement learners implemented in python. Mainly including DQN and its variants
☆54Updated 8 years ago
renmengye / tensorflow-forward-ad
Forward-mode Automatic Differentiation for TensorFlow
☆139Updated 7 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
itaicaspi / keras-dqn-doom
Keras implementation of DQN on ViZDoom environment
☆54Updated 8 years ago
deepsense-ai / Distributed-BA3C
☆56Updated 2 years ago
openai / baselines-results
☆117Updated 4 years ago
ehknight / natural-gradient-deep-q-learning
☆22Updated 6 years ago
unixpickle / anyrl-py
A reinforcement learning framework
☆155Updated 6 years ago
bigaidream-projects / drmad
DrMAD
☆107Updated 7 years ago
jn2clark / ReinforcementLearning
☆88Updated 8 years ago
siemens / policy_search_bb-alpha
☆69Updated 7 years ago
jonathonbyrd / deep_rl_ale
An implementation of Deep Reinforcement Learning / Deep Q-Networks for Atari games in TensorFlow
☆74Updated 8 years ago
Zeta36 / Asynchronous-Methods-for-Deep-Reinforcement-Learning
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …
☆84Updated 9 years ago
uvadlc / uvadlc_practicals_2016
Repository for practical assignments for UvA Deep Learning Course 2016
☆51Updated 7 years ago
Kaixhin / NoisyNet-A3C
Noisy Networks for Exploration
☆186Updated 7 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Updated 7 years ago
runopti / Learning-To-Learn
TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"
☆84Updated 8 years ago
rgilman33 / baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆53Updated 5 years ago