openai / gym-wikinavLinks
Wikipedia navigation environment for OpenAI Gym
☆42Updated 2 years ago
Alternatives and similar repositories for gym-wikinav
Users that are interested in gym-wikinav are comparing it to the libraries listed below
Sorting:
- ViZDoom Python wrapper☆76Updated 2 years ago
- Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"☆145Updated 6 years ago
- Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"☆56Updated 8 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆42Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆76Updated 2 years ago
- OpenAI Retro Contest☆67Updated 2 years ago
- Training Sonic with RLlib☆61Updated 2 years ago
- tensorflow deep RL hacking on minecraft with malmo☆54Updated 8 years ago
- This is the 0.4 release of the Arcade Learning Environment (ALE), a platform designed for AI research. ALE is based on Stella, an Atari 2…☆160Updated 8 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆129Updated 5 years ago
- Task and example code for the Malmo Collaborative AI Challenge☆154Updated 3 years ago
- [deprecated] Bridge from Gym to ROS robots☆74Updated 2 years ago
- Direct Future Prediction (DFP ) in Keras☆109Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- A lua wrapper for the Arcade Learning Environment/xitari.☆34Updated 9 years ago
- Web-based Reinforcement Learning Control Center☆65Updated 9 years ago
- a python3 compatible pyconfigatron☆10Updated 9 years ago
- ☆120Updated 5 years ago
- Gym - Doom environments based on VizDoom.☆103Updated 8 years ago
- ☆38Updated 9 years ago
- 3d cartpole gym env using bullet physics trained from pixels with tensorflow LRPG, DDPG & NAF☆58Updated 8 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45Updated 8 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Updated 2 years ago
- Publicly releasable baselines for the Retro contest☆130Updated 6 years ago
- ☆101Updated 9 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆34Updated 6 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆350Updated 6 years ago
- Code for the paper "Evolved Policy Gradients"☆254Updated 6 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 7 years ago