openai / gym-wikinav
Wikipedia navigation environment for OpenAI Gym
☆40Updated 2 years ago
Alternatives and similar repositories for gym-wikinav:
Users that are interested in gym-wikinav are comparing it to the libraries listed below
- Training Sonic with RLlib☆59Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆72Updated 2 years ago
- OpenAI Retro Contest☆65Updated 2 years ago
- [deprecated] Bridge from Gym to ROS robots☆73Updated 2 years ago
- ViZDoom Python wrapper☆74Updated 2 years ago
- Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"☆141Updated 6 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45Updated 7 years ago
- Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"☆56Updated 8 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated 2 years ago
- Code for the paper "World of Bits: An Open-Domain Platform for Web-Based Agents"☆30Updated 6 years ago
- Backprop training of recurrent neural networks with Hebbian plastic connections☆20Updated 3 years ago
- ☆19Updated 9 years ago
- tensorflow deep RL hacking on minecraft with malmo☆54Updated 8 years ago
- A lua wrapper for the Arcade Learning Environment/xitari.☆34Updated 8 years ago
- 3d cartpole gym env using bullet physics trained from pixels with tensorflow LRPG, DDPG & NAF☆58Updated 8 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- ☆24Updated 9 years ago
- Code for Emergent Translation in Multi-Agent Communication☆80Updated 6 years ago
- ☆117Updated 4 years ago
- Unsupervised Data Generated for GeoQuery and SAIL Datasets☆46Updated 8 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆33Updated 6 years ago
- Asynchronous Advantage Actor Critic☆20Updated 8 years ago
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 8 years ago
- a python3 compatible pyconfigatron☆8Updated 8 years ago
- D-NTM paper repo☆25Updated 7 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- Universal library for deep reinforcement learning.☆38Updated 9 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 8 years ago
- Tensorflow Implementation of Programmable Agents☆35Updated 7 years ago