Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Theano + OpenAI Gym)[1-step Q-learning, n-step Q-learning, A3C]
☆44Feb 27, 2018Updated 8 years ago
Alternatives and similar repositories for async-rl
Users that are interested in async-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using Pilco algorithm to find a controller for few robotic problems☆43Jul 31, 2015Updated 10 years ago
- PyOblige is Python wrapper for OBLIGE - random level generator for Doom☆11Jul 2, 2018Updated 7 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆30Jun 26, 2016Updated 9 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆588Aug 9, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Jul 4, 2018Updated 7 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Oct 28, 2016Updated 9 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆394Sep 4, 2018Updated 7 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Python based Deep CNN Q-Learner for FOREX☆26Oct 29, 2015Updated 10 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- Tensorflow Implementation of Programmable Agents☆35Sep 25, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- an implementation of reinforcement learning problem, stock prices☆10Dec 26, 2016Updated 9 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆422Feb 13, 2019Updated 7 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Nov 26, 2015Updated 10 years ago
- starter kit for vizdoom2018-singleplayer track☆28Jul 29, 2018Updated 7 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Gym - 32 levels of original Super Mario Bros☆290Dec 21, 2018Updated 7 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Oct 12, 2016Updated 9 years ago
- Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"☆1,006Mar 18, 2018Updated 8 years ago
- SCAN: Learning Abstract Hierarchical Compositional Visual Concepts☆53Oct 29, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Use Asynchronous advantage actor-critic algorithm (A3C) to play Flappy Bird using Keras☆39Aug 26, 2017Updated 8 years ago
- A starter agent that can solve a number of universe environments.☆1,101Apr 7, 2018Updated 8 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆39Mar 28, 2018Updated 8 years ago
- An attempt at implementing ideas in "Learning to Transduce with Unbounded Memory" (http://arxiv.org/abs/1506.02516)☆11Jul 27, 2016Updated 9 years ago
- TensorFlow A2C to solve Acrobot, with synchronized parallel environments☆35Apr 21, 2018Updated 8 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆17Jan 6, 2018Updated 8 years ago
- A framework designed to facilitate statistics and AI research in the game of Blackjack.☆13Mar 20, 2011Updated 15 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Jul 20, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 7 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆93Jun 21, 2017Updated 8 years ago
- List Decoder for the Polarization Weight family of Quantum Polar Code.☆12Feb 3, 2025Updated last year
- A multi-agent soccer simulator in a grid-world environment, with agents implementing different reinforcement learning algorithms☆13Jun 4, 2017Updated 9 years ago
- ☆28Apr 28, 2019Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago