YunqiuXu / my_notes
Are you afraid :D
☆22Updated 5 years ago
Alternatives and similar repositories for my_notes:
Users that are interested in my_notes are comparing it to the libraries listed below
- simple reinforcement learning example for the minecraft☆9Updated 6 years ago
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 6 years ago
- ☆8Updated 8 years ago
- tensorflow_serving inception gRPC client☆12Updated 8 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- LIBBLE by Parameter Server☆17Updated 6 years ago
- TensorFlow and deep learning without a PhD, translated to Chinese☆17Updated 8 years ago
- AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning☆15Updated 6 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Hybrid Linear UCB Multi-arm Bandit library☆14Updated 8 years ago
- Repo containing to-dos and instructions for DRL in POMDPs.jl☆11Updated 8 years ago
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]☆13Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- tag doc using topN words with lda☆10Updated 9 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Reinforcement learning in 3D.☆21Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- Favorite AI papers☆16Updated 7 years ago
- A training and testing framework supporting experiments in CIKM 2016 paper "User Response Learning for Directly Optimizing Campaign Perfo…☆25Updated 6 years ago
- ☆53Updated 8 years ago
- Imitation Learning Homework 1☆36Updated 7 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Amazon Recommendation System build on BPR TensorFlow implementation☆16Updated 7 years ago
- This is a paper list for recent studies on optimization algorithms.☆12Updated 6 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆30Updated 7 years ago
- From Word Embeddings to Item Recommendation☆10Updated last year