An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.
☆29Oct 18, 2017Updated 8 years ago
Alternatives and similar repositories for A3C-LSTM-with-Tensorflow
Users that are interested in A3C-LSTM-with-Tensorflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Training Reinforcement Learning agent using derivative of Generative Recurrent Neural Network model of environment☆24Nov 11, 2016Updated 9 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tensorflow implementation of the Differentiable Neural Computer☆12Feb 15, 2019Updated 7 years ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 7 years ago
- HAProxy combined with confd for HTTP load balancing with SSL offloading☆10Feb 5, 2017Updated 9 years ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Reinforcement Learning framework to facilitate development and use of scalable RL algorithms and applications☆61Mar 29, 2018Updated 8 years ago
- Neural network reinforcement Q-learning for an avoidance game☆10Aug 21, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Starter code and instructions for participating in MultiON Challenge 2021.☆12Jun 12, 2024Updated 2 years ago
- Helpful files for Visual Doom AI Competition 2017☆44Jun 21, 2018Updated 7 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- ☆10Sep 20, 2018Updated 7 years ago
- nd009-cn-advanced-p5,针对Udacity CN MLND P5项目☆14Jun 27, 2022Updated 3 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- PySC2 OpenAI Gym Environments☆49Jan 23, 2019Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆422Feb 13, 2019Updated 7 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆588Aug 9, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Mar 10, 2016Updated 10 years ago
- Gathers machine learning and deep learning models for Reinforcement Learning☆10Sep 8, 2018Updated 7 years ago
- ☆14Oct 5, 2017Updated 8 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 3 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Jul 31, 2015Updated 10 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- paper implementation for the Machine Learning for Computer Vision lecture☆34Mar 15, 2018Updated 8 years ago
- ☆10Nov 12, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Customised UITextField and UITextView with HintLabel, ErrorLabel, Divider and validations☆10May 20, 2016Updated 10 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- Accelerated Methods for Deep Reinforcement Learning☆49Mar 20, 2019Updated 7 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Jul 31, 2020Updated 5 years ago
- ☆10Sep 19, 2019Updated 6 years ago