My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
☆37Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using N-step dueling DDQN with PER for playing Pacman game☆22Oct 27, 2019Updated 6 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- 学习DRL CNN -> DQN -> LSTM☆13Oct 7, 2018Updated 7 years ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deep Reinforcement Learning for Dynamic Multicahnnel Access in Wireless Networks☆14Oct 1, 2017Updated 8 years ago
- Dockerfiles for OpenAI's Gym with Tensorflow☆18Jul 25, 2018Updated 7 years ago
- Simple Example A3C Reinforcement Learning Algorithm in Tensorflow☆13May 23, 2017Updated 8 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆14Dec 4, 2018Updated 7 years ago
- Source code for the papers "Deep-reinforcement learning for fair distributed dynamic spectrum access in wireless networks" and "Deep‐rein…☆13Oct 12, 2022Updated 3 years ago
- The workshop is designed to foster an enabling environment for individuals to build competence in the Edge Computing space.☆14Jun 13, 2023Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 5 months ago
- Flexible resource allocation for edge cloud computing with reinforcement learning☆38May 29, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Google AI Research☆10Mar 11, 2020Updated 6 years ago
- Welcome to 6.86x Machine Learning with Python–From Linear Models to Deep Learning. Machine learning methods are commonly used across eng…☆13Nov 16, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- A Mobile edge computing server placement algorithm, written from scratch for 5g server placement depending upon various KPIs across a ar…☆12Sep 14, 2022Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- Proactive Content Caching with Deep Learning☆14Oct 17, 2022Updated 3 years ago
- Cythonized versions of the OpenAI Gym classic control environments.☆12Apr 7, 2020Updated 6 years ago
- Website for the ICML 2021 tutorial on Random Matrix Theory and Machine Learning☆16Dec 8, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is a project based on OpenAI's multi-agent-emergence-environments (Emergent Tool Use from Multi-Agent Autocurricula, Baker et al.), …☆13Jan 5, 2021Updated 5 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 11 months ago
- Python code for interactive parallel coordinates visualization on jupyter notebook.☆12Sep 8, 2019Updated 6 years ago
- ☆18Oct 4, 2024Updated last year
- WLAN channel access through Multi-Agent Reinforcement Learning (MARL)☆11Mar 2, 2022Updated 4 years ago
- ☆17Dec 12, 2022Updated 3 years ago
- This repo contains the code demonstrated in the Analytics Vidhya article about PyWebIO usage and the ML model prediction code.☆11Apr 22, 2021Updated 4 years ago
- Implementation of deep reinforcement learning for optimizing the beams and predicting the blockage events☆17Nov 8, 2020Updated 5 years ago
- ☆10Sep 25, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆38Oct 20, 2021Updated 4 years ago
- 记录自己学习Deep Learning的笔记和代码☆16Sep 23, 2020Updated 5 years ago
- ☆13Feb 5, 2023Updated 3 years ago
- Asilomar 2020 code for Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks☆41Jul 27, 2020Updated 5 years ago
- ☆17Mar 13, 2021Updated 5 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Jul 5, 2019Updated 6 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago