My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
☆37Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using N-step dueling DDQN with PER for playing Pacman game☆22Oct 27, 2019Updated 6 years ago
- A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum …☆153Mar 12, 2026Updated 3 months ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- 学习DRL CNN -> DQN -> LSTM☆13Oct 7, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Deep Reinforcement Learning for Dynamic Multicahnnel Access in Wireless Networks☆14Oct 1, 2017Updated 8 years ago
- Dockerfiles for OpenAI's Gym with Tensorflow☆18Jul 25, 2018Updated 7 years ago
- Q learning and DQN☆10Mar 14, 2022Updated 4 years ago
- Simple Example A3C Reinforcement Learning Algorithm in Tensorflow☆13May 23, 2017Updated 9 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆15Dec 14, 2024Updated last year
- ☆14Dec 4, 2018Updated 7 years ago
- Source code for the papers "Deep-reinforcement learning for fair distributed dynamic spectrum access in wireless networks" and "Deep‐rein…☆14Oct 12, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The workshop is designed to foster an enabling environment for individuals to build competence in the Edge Computing space.☆14Jun 13, 2023Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 7 months ago
- Google AI Research☆10Mar 11, 2020Updated 6 years ago
- Welcome to 6.86x Machine Learning with Python–From Linear Models to Deep Learning. Machine learning methods are commonly used across eng…☆13Nov 16, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- A Mobile edge computing server placement algorithm, written from scratch for 5g server placement depending upon various KPIs across a ar…☆12Sep 14, 2022Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- Bridging OptiTrack (MoCap) pose measurements to PX4 through ROS 2☆14Oct 10, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆27May 12, 2025Updated last year
- ☆18Oct 4, 2024Updated last year
- WLAN channel access through Multi-Agent Reinforcement Learning (MARL)☆11Mar 2, 2022Updated 4 years ago
- ☆17Dec 12, 2022Updated 3 years ago
- Implementation of deep reinforcement learning for optimizing the beams and predicting the blockage events☆17Nov 8, 2020Updated 5 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆38Oct 20, 2021Updated 4 years ago
- ☆13Feb 5, 2023Updated 3 years ago
- A part of C++ optimization lib based on armadillo. It just implements one of the frequently used functions fmincon().☆16Jul 19, 2022Updated 3 years ago
- Asilomar 2020 code for Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks☆41Jul 27, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Mar 13, 2021Updated 5 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Jul 5, 2019Updated 6 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Projectwork of a mini-drone offboard application using PX4-ros2☆16Jan 25, 2024Updated 2 years ago
- Predicts the CAISO day-ahead market hourly prices using different forecasting methods including ARIMA and LSTM.☆26Jun 17, 2020Updated 6 years ago
- Dueling Double Deep Q Network with Prioritized Experience Replay Memory☆10Aug 19, 2022Updated 3 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago