Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Theano + OpenAI Gym)[1-step Q-learning, n-step Q-learning, A3C]
☆44Feb 27, 2018Updated 8 years ago
Alternatives and similar repositories for async-rl
Users that are interested in async-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- PyOblige is Python wrapper for OBLIGE - random level generator for Doom☆11Jul 2, 2018Updated 7 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Mar 4, 2016Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆30Jun 26, 2016Updated 9 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆588Aug 9, 2018Updated 7 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Jul 4, 2018Updated 7 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Oct 28, 2016Updated 9 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆394Sep 4, 2018Updated 7 years ago
- KEras Reinforcement Learning gYM agents☆291Jul 8, 2017Updated 8 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13May 4, 2018Updated 8 years ago
- Python based Deep CNN Q-Learner for FOREX☆26Oct 29, 2015Updated 10 years ago
- Tensorflow Implementation of Programmable Agents☆35Sep 25, 2017Updated 8 years ago
- A python class to extract current and historical data from famous Yahoo Finance API☆12Feb 28, 2019Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆422Feb 13, 2019Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 9 years ago
- Estimating stock price correlations using Wikipedia☆25May 11, 2016Updated 10 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Nov 26, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Gym - 32 levels of original Super Mario Bros☆290Dec 21, 2018Updated 7 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Oct 12, 2016Updated 9 years ago
- Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"☆1,006Mar 18, 2018Updated 8 years ago
- Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.☆12Dec 24, 2016Updated 9 years ago
- SCAN: Learning Abstract Hierarchical Compositional Visual Concepts☆53Oct 29, 2017Updated 8 years ago
- A TypeSpec Emitter creating Typescript from Models and generating a structured routes object for HTTP APIs.☆18Jan 30, 2026Updated 3 months ago
- Basic DQN implementation☆226Dec 28, 2017Updated 8 years ago
- Use Asynchronous advantage actor-critic algorithm (A3C) to play Flappy Bird using Keras☆39Aug 26, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A starter agent that can solve a number of universe environments.☆1,103Apr 7, 2018Updated 8 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆39Mar 28, 2018Updated 8 years ago
- An attempt at implementing ideas in "Learning to Transduce with Unbounded Memory" (http://arxiv.org/abs/1506.02516)☆11Jul 27, 2016Updated 9 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Apr 21, 2018Updated 8 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆17Jan 6, 2018Updated 8 years ago
- Github Action to scrape an RSS feed to display on a Github Pages website☆12May 1, 2025Updated last year
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Jul 20, 2018Updated 7 years ago