My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
☆37Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- 学习DRL CNN -> DQN -> LSTM☆13Oct 7, 2018Updated 7 years ago
- Dockerfiles for OpenAI's Gym with Tensorflow☆18Jul 25, 2018Updated 7 years ago
- Q learning and DQN☆10Mar 14, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- Simple Example A3C Reinforcement Learning Algorithm in Tensorflow☆13May 23, 2017Updated 8 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆14Dec 14, 2024Updated last year
- Source code for the papers "Deep-reinforcement learning for fair distributed dynamic spectrum access in wireless networks" and "Deep‐rein…☆13Oct 12, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 5 months ago
- Welcome to 6.86x Machine Learning with Python–From Linear Models to Deep Learning. Machine learning methods are commonly used across eng…☆13Nov 16, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- A Mobile edge computing server placement algorithm, written from scratch for 5g server placement depending upon various KPIs across a ar…☆12Sep 14, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dynamic channel allocation in cellular networks by reinforcement learning☆18May 25, 2022Updated 3 years ago
- Website for the ICML 2021 tutorial on Random Matrix Theory and Machine Learning☆16Dec 8, 2021Updated 4 years ago
- This is a project based on OpenAI's multi-agent-emergence-environments (Emergent Tool Use from Multi-Agent Autocurricula, Baker et al.), …☆13Jan 5, 2021Updated 5 years ago
- Implementation of sliding mode controllers and differentiator of Matlab☆15Nov 15, 2016Updated 9 years ago
- This repository is a SIMULINK simulation of sliding mode control with classic and optimized methods. See the references for more informat…☆14Jan 5, 2018Updated 8 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆28May 12, 2025Updated 11 months ago
- WLAN channel access through Multi-Agent Reinforcement Learning (MARL)☆11Mar 2, 2022Updated 4 years ago
- ☆17Dec 12, 2022Updated 3 years ago
- Code for our ICRA 2024 paper on learning diverse skills☆26Apr 6, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆38Oct 20, 2021Updated 4 years ago
- ☆13Feb 5, 2023Updated 3 years ago
- A part of C++ optimization lib based on armadillo. It just implements one of the frequently used functions fmincon().☆16Jul 19, 2022Updated 3 years ago
- Asilomar 2020 code for Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks☆41Jul 27, 2020Updated 5 years ago
- ☆17Mar 13, 2021Updated 5 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Jul 5, 2019Updated 6 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- Projectwork of a mini-drone offboard application using PX4-ros2☆16Jan 25, 2024Updated 2 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Random Network Distillation pytorch☆261Mar 4, 2019Updated 7 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Jun 19, 2021Updated 4 years ago
- ☆14Mar 8, 2026Updated 2 months ago
- [RSS 2026] The first framework enabling humanoid robots to learn whole-body loco-manipulation from egocentric human demos☆113Apr 10, 2026Updated 3 weeks ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆173Oct 24, 2023Updated 2 years ago
- ☆13Feb 27, 2023Updated 3 years ago