My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
☆37Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using N-step dueling DDQN with PER for playing Pacman game☆22Oct 27, 2019Updated 6 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- 学习DRL CNN -> DQN -> LSTM☆13Oct 7, 2018Updated 7 years ago
- Dockerfiles for OpenAI's Gym with Tensorflow☆18Jul 25, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Q learning and DQN☆10Mar 14, 2022Updated 4 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- Simple Example A3C Reinforcement Learning Algorithm in Tensorflow☆13May 23, 2017Updated 9 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆15Dec 14, 2024Updated last year
- ☆14Dec 4, 2018Updated 7 years ago
- Source code for the papers "Deep-reinforcement learning for fair distributed dynamic spectrum access in wireless networks" and "Deep‐rein…☆13Oct 12, 2022Updated 3 years ago
- The workshop is designed to foster an enabling environment for individuals to build competence in the Edge Computing space.☆14Jun 13, 2023Updated 2 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Flexible resource allocation for edge cloud computing with reinforcement learning☆37May 29, 2020Updated 6 years ago
- Google AI Research☆10Mar 11, 2020Updated 6 years ago
- Welcome to 6.86x Machine Learning with Python–From Linear Models to Deep Learning. Machine learning methods are commonly used across eng…☆13Nov 16, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- ☆12Apr 25, 2022Updated 4 years ago
- A Mobile edge computing server placement algorithm, written from scratch for 5g server placement depending upon various KPIs across a ar…☆12Sep 14, 2022Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- Dynamic channel allocation in cellular networks by reinforcement learning☆18May 25, 2022Updated 4 years ago
- Proactive Content Caching with Deep Learning☆14Oct 17, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Cythonized versions of the OpenAI Gym classic control environments.☆12Apr 7, 2020Updated 6 years ago
- This is a project based on OpenAI's multi-agent-emergence-environments (Emergent Tool Use from Multi-Agent Autocurricula, Baker et al.), …☆13Jan 5, 2021Updated 5 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆27May 12, 2025Updated last year
- ☆18Oct 4, 2024Updated last year
- Code for our ICRA 2024 paper on learning diverse skills☆27Apr 6, 2024Updated 2 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆38Oct 20, 2021Updated 4 years ago
- 记录自己学习Deep Learning的笔记和代码☆16Sep 23, 2020Updated 5 years ago
- Asilomar 2020 code for Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks☆41Jul 27, 2020Updated 5 years ago
- Algorithm implementation for our paper: Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics☆53Jan 26, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Mar 13, 2021Updated 5 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Jul 5, 2019Updated 6 years ago
- Deep Successor Representation☆18Mar 6, 2018Updated 8 years ago
- Dueling Double Deep Q Network with Prioritized Experience Replay Memory☆10Aug 19, 2022Updated 3 years ago
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆19Sep 17, 2019Updated 6 years ago
- Supplementary code for “Versatile Loco-Manipulation through Flexible Interlimb Coordination”☆70Apr 9, 2026Updated last month
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Apr 20, 2017Updated 9 years ago