openai / gym-wikinav
Wikipedia navigation environment for OpenAI Gym
☆40Updated last year
Alternatives and similar repositories for gym-wikinav:
Users that are interested in gym-wikinav are comparing it to the libraries listed below
- Training Sonic with RLlib☆59Updated last year
- ViZDoom Python wrapper☆74Updated last year
- OpenAI Retro Contest☆65Updated last year
- Add-on package to gym, to record sequences of actions, observations, and rewards☆72Updated last year
- Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"☆138Updated 6 years ago
- [deprecated] Bridge from Gym to ROS robots☆74Updated last year
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆32Updated 6 years ago
- Dataset for the spaceship task from "Metacontrol for Adaptive Imagination-Based Optimization"☆56Updated 7 years ago
- Add-on for OpenAI Gym that supports automatic downloading of user environments.☆45Updated 7 years ago
- Code for the paper "World of Bits: An Open-Domain Platform for Web-Based Agents"☆29Updated 6 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆126Updated 4 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated last year
- tensorflow deep RL hacking on minecraft with malmo☆54Updated 8 years ago
- A lua wrapper for the Arcade Learning Environment/xitari.☆34Updated 8 years ago
- ☆27Updated 6 years ago
- ☆24Updated 9 years ago
- ☆28Updated 5 years ago
- ☆13Updated 9 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- Retask is a simple task queue implementation written for human beings. It provides generic solution to create and manage task queues.☆16Updated 8 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated last year
- ☆19Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 6 years ago
- tensorflow reinforcement learning agents for OpenAI gym environments☆113Updated 7 years ago
- This is the 0.4 release of the Arcade Learning Environment (ALE), a platform designed for AI research. ALE is based on Stella, an Atari 2…☆160Updated 7 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Updated 6 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 8 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13Updated 3 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Unsupervised Data Generated for GeoQuery and SAIL Datasets☆46Updated 8 years ago