XiaoxiaoGuo/atari_uct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XiaoxiaoGuo/atari_uct)

XiaoxiaoGuo / atari_uct

Upper Confidence Tree Planner for ATARI games

☆19

Alternatives and similar repositories for atari_uct

Users that are interested in atari_uct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Spider-scnu / Monte-Carlo-tree-search-for-TSP
View on GitHub
This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).
☆35Sep 25, 2019Updated 6 years ago
mihahauke / deep_rl_vizdoom
View on GitHub
Deep reinforcement learning in ViZDoom (using Tensorflow)
☆19Jan 25, 2018Updated 8 years ago
jeanharb / a2oc_delib
View on GitHub
A3C style Option-Critic with deliberation cost
☆40Jan 9, 2018Updated 8 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
mwydmuch / PyOblige
View on GitHub
PyOblige is Python wrapper for OBLIGE - random level generator for Doom
☆11Jul 2, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mudassirej / DQN-with-tensorflow-in-Gazebo
View on GitHub
Autonomous visual navigation using the depth images
☆11Aug 15, 2019Updated 6 years ago
koulanurag / dream-and-search
View on GitHub
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
☆12Jul 12, 2021Updated 5 years ago
wingsweihua / hellosumo
View on GitHub
Very very simple run on sumo
☆13May 14, 2018Updated 8 years ago
JuliaPOMDP / TabularTDLearning.jl
View on GitHub
Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA
☆12Nov 16, 2025Updated 8 months ago
glassices / sokoban_planner
View on GitHub
☆10Jun 29, 2021Updated 5 years ago
junhyukoh / nips2015-action-conditional-video-prediction
View on GitHub
Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"
☆114Feb 8, 2016Updated 10 years ago
CatherineMeng / FGYM-user-demo
View on GitHub
Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning
☆14Aug 12, 2021Updated 4 years ago
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
musyoku / double-dqn
View on GitHub
Chainer implementation of Double Deep Q-Network (Double DQN)
☆27Mar 30, 2016Updated 10 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
rajammanabrolu / KG-A2C
View on GitHub
Goal driven language generation using knowledge graph A2C agents
☆62Mar 3, 2020Updated 6 years ago
ibrahim-elshar / gym-windy-gridworlds
View on GitHub
Windy GridWorlds environments compatible with OpenAI gym.
☆15Jul 8, 2022Updated 4 years ago
ricardoGrando / hydrone_deep_rl_jint
View on GitHub
☆14May 10, 2021Updated 5 years ago
mihahauke / VDAIC2017
View on GitHub
Helpful files for Visual Doom AI Competition 2017
☆44Jun 21, 2018Updated 8 years ago
sammiekatt / fba-pomdp
View on GitHub
Factored model-based Bayesian Reinforcement Learning framework
☆21Nov 23, 2022Updated 3 years ago
dorlivne / PoPS
View on GitHub
PoPS algorithm
☆15Dec 8, 2022Updated 3 years ago
sudeepraja / Model-Free-Episodic-Control
View on GitHub
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆52Jul 25, 2016Updated 10 years ago
rangl-labs / netzerotc
View on GitHub
☆11Jul 15, 2022Updated 4 years ago
liuanji / WU-UCT
View on GitHub
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆124Apr 26, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wwxFromTju / sc2-101-zh
View on GitHub
just for fun
☆22Sep 10, 2017Updated 8 years ago
h4ri98 / UAV-Path-planning-python
View on GitHub
Astar and RRT implementation using matplotlib
☆10May 24, 2020Updated 6 years ago
KornbergFresnel / CommNet
View on GitHub
an implementation of CommNet
☆35Nov 14, 2017Updated 8 years ago
zzyunzhi / asynch-mb
View on GitHub
(CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning
☆14Dec 27, 2022Updated 3 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
oxwhirl / treeqn
View on GitHub
☆93Nov 15, 2019Updated 6 years ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 8 years ago
gliese581gg / batch-A3C_tensorflow
View on GitHub
Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'
☆21Dec 15, 2016Updated 9 years ago
igsor / HDPy
View on GitHub
Heuristic Dynamic Programming with Python
☆14Jul 28, 2014Updated 11 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
philipjhj / TDP-MSRC-AI-Challenge
View on GitHub
☆20May 24, 2017Updated 9 years ago
jeanharb / option_critic
View on GitHub
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆183Sep 21, 2017Updated 8 years ago
thomas-schillaci / SimPLe
View on GitHub
PyTorch implementation of SimPLe (Simulated Policy Learning) on the Atari 100k benchmark.
☆17Dec 7, 2022Updated 3 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
mklissa / PPOC
View on GitHub
Proximal Policy Option-Critic
☆26Jan 4, 2019Updated 7 years ago
nips2018axiomatic / Mapping-Images-to-Scene-Graphs-master
View on GitHub
Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction
☆12Aug 1, 2018Updated 7 years ago
BigBayes / SGMCMC.jl
View on GitHub
Stochastic Gradient Markov Chain Monte Carlo and Optimisation
☆17Mar 21, 2017Updated 9 years ago