openai / gym-wikinavLinks

Wikipedia navigation environment for OpenAI Gym

☆40

Alternatives and similar repositories for gym-wikinav

Users that are interested in gym-wikinav are comparing it to the libraries listed below

Sorting:

openai / neural-gpu
Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"
☆144Updated 6 years ago
google-deepmind / spaceship_dataset
Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"
☆56Updated 8 years ago
openai / doom-py
ViZDoom Python wrapper
☆74Updated 2 years ago
openai / retro-contest
OpenAI Retro Contest
☆65Updated 2 years ago
openai / gym-recording
Add-on package to gym, to record sequences of actions, observations, and rewards
☆74Updated 2 years ago
openai / sonic-on-ray
Training Sonic with RLlib
☆59Updated 2 years ago
openai / pachi-py
Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.
☆40Updated 2 years ago
matpalm / malmomo
tensorflow deep RL hacking on minecraft with malmo
☆54Updated 8 years ago
bbitmaster / ale_python_interface
A Python Interface for the Arcade Learning Environment (Shared Object)
☆128Updated 4 years ago
google-deepmind / xitari
This is the 0.4 release of the Arcade Learning Environment (ALE), a platform designed for AI research. ALE is based on Stella, an Atari 2…
☆160Updated 7 years ago
openai / rosbridge
[deprecated] Bridge from Gym to ROS robots
☆73Updated 2 years ago
matpalm / cartpoleplusplus
3d cartpole gym env using bullet physics trained from pixels with tensorflow LRPG, DDPG & NAF
☆58Updated 8 years ago
flyyufelix / Direct-Future-Prediction-Keras
Direct Future Prediction (DFP ) in Keras
☆109Updated 7 years ago
kvfrans / parallel-trpo
A parallel version of Trust Region Policy Optimization
☆65Updated 8 years ago
ilyasu123 / trpo
☆19Updated 9 years ago
wojzaremba / trpo_rnn
☆20Updated 9 years ago
karpathy / tf-agent
tensorflow reinforcement learning agents for OpenAI gym environments
☆117Updated 8 years ago
tambetm / gymexperiments
☆28Updated 6 years ago
georgesung / deep_rl_acrobot
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
☆35Updated 7 years ago
openai / pyconfigatron
a python3 compatible pyconfigatron
☆8Updated 8 years ago
google-deepmind / unsup-queries-data
Unsupervised Data Generated for GeoQuery and SAIL Datasets
☆46Updated 8 years ago
awjuliani / RL-CC
Web-based Reinforcement Learning Control Center
☆64Updated 8 years ago
wojzaremba / trpo
☆101Updated 8 years ago
jsikyoon / programmable-agents_tensorflow
Tensorflow Implementation of Programmable Agents
☆35Updated 7 years ago
microsoft / malmo-challenge
Task and example code for the Malmo Collaborative AI Challenge
☆154Updated 2 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆66Updated 5 years ago
cosmoharrigan / neuroevolution
Neuroevolution as a direct policy search deep reinforcement learning method, implemented using Keras and DEAP.
☆71Updated 4 years ago
rll / deeprlhw2
☆24Updated 9 years ago
dbobrenko / async-deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
☆42Updated 7 years ago
openai / EPG
Code for the paper "Evolved Policy Gradients"
☆250Updated 6 years ago