An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.
☆29Oct 18, 2017Updated 8 years ago
Alternatives and similar repositories for A3C-LSTM-with-Tensorflow
Users that are interested in A3C-LSTM-with-Tensorflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Tensorflow DQN and DRQN agent playing doom☆35May 5, 2017Updated 9 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Dec 26, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 9 years ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 7 years ago
- HAProxy combined with confd for HTTP load balancing with SSL offloading☆10Feb 5, 2017Updated 9 years ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- Combining deep learning and reinforcement learning.☆81Apr 22, 2026Updated 2 weeks ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Reinforcement Learning framework to facilitate development and use of scalable RL algorithms and applications☆61Mar 29, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Neural network reinforcement Q-learning for an avoidance game☆10Aug 21, 2017Updated 8 years ago
- Helpful files for Visual Doom AI Competition 2017☆45Jun 21, 2018Updated 7 years ago
- A stock trading system that can sell and buy stocks through Tonghuashun platform, mostly for testing strategy in real time using simulati…☆12Feb 8, 2017Updated 9 years ago
- ☆10Sep 20, 2018Updated 7 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Apr 23, 2017Updated 9 years ago
- Fast domain-aware neural network emulation of a planetary boundary layer parameterization in a numerical weather forecast model☆12Mar 26, 2019Updated 7 years ago
- Polish stopwords collection☆15Mar 5, 2020Updated 6 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- PySC2 OpenAI Gym Environments☆49Jan 23, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆588Aug 9, 2018Updated 7 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- Gathers machine learning and deep learning models for Reinforcement Learning☆10Sep 8, 2018Updated 7 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Jul 31, 2015Updated 10 years ago
- ☆11Aug 13, 2020Updated 5 years ago
- battery-trading-benchmark is an open source tool to determine the optimal value an Energy Storage System (ESS) can earn on a specific ele…☆10Aug 21, 2024Updated last year
- SPM Cluster Size Threshold estimation☆13Feb 6, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 臸娥粂陆亩竟☆10May 11, 2024Updated last year
- ☆10Nov 12, 2020Updated 5 years ago
- Customised UITextField and UITextView with HintLabel, ErrorLabel, Divider and validations☆10May 20, 2016Updated 9 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- Accelerated Methods for Deep Reinforcement Learning☆49Mar 20, 2019Updated 7 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- 2 algorithms of optimal trade execution: 1) Dynamic Programming 2) Frank-Wolfe Algorithm (Python & C++)☆19Dec 11, 2019Updated 6 years ago