Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
☆61Mar 11, 2022Updated 3 years ago
Alternatives and similar repositories for P-DQN
Users that are interested in P-DQN are comparing it to the libraries listed below
Sorting:
- my code for paper Parameterized-DQN☆25Mar 5, 2021Updated 5 years ago
- ☆71May 9, 2024Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆27Feb 24, 2023Updated 3 years ago
- Development of parametric, deep learning, and reinforcement learning agent-based model of car-following behaviour. The models aim to be d…☆25Nov 18, 2019Updated 6 years ago
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.☆13Dec 8, 2021Updated 4 years ago
- ☆16Jan 30, 2025Updated last year
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29May 12, 2025Updated 9 months ago
- ☆13Jan 26, 2023Updated 3 years ago
- Hybrid action space reinforcement learning algorithms.☆13Mar 26, 2021Updated 4 years ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆15Jul 17, 2021Updated 4 years ago
- Collection of OpenAI parametrized action-space environments.☆69Mar 19, 2025Updated 11 months ago
- Eco-driving toolbox for battery electric vehicles using detailed loss models☆19Dec 6, 2022Updated 3 years ago
- This project explores object tracking of human and vehicle targets in the infrared using Matlab. Using the OTCBVS Benchmark Dataset Colle…☆20Mar 2, 2015Updated 11 years ago
- ☆48Apr 24, 2022Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆87Jul 15, 2022Updated 3 years ago
- OpenAI Gym environment for Robot Soccer Goal☆18May 17, 2019Updated 6 years ago
- A study code for HEVs eco-driving control☆21Jan 14, 2023Updated 3 years ago
- A novel Hierarchical Imitation Learning algorithm based on AIRL.☆23May 20, 2023Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Aug 26, 2022Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆27Nov 23, 2024Updated last year
- OpenAI Gym environment for Platform☆21May 17, 2019Updated 6 years ago
- Model of a parallel-series hybrid-electric vehicle with system-level and detailed variants of electrical system.☆60Oct 25, 2025Updated 4 months ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆61Jul 11, 2022Updated 3 years ago
- DeceFL: A Principled Decentralized Federated Learning Framework☆29Jan 25, 2023Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆65Aug 3, 2023Updated 2 years ago
- Neural Koopman Lyapunov Control☆26May 5, 2023Updated 2 years ago
- This project aims to design a LQR controller to control a car following a trajectory as accurate and fast as possible☆30Jul 17, 2019Updated 6 years ago
- [RA-L & ICRA 2021] Adversarial Inverse Reinforcement Learning with Self-attention Dynamics Model☆31Aug 5, 2022Updated 3 years ago
- a DRL-based vehicle scheduling algorithm☆32Sep 30, 2022Updated 3 years ago
- A multi-agent reinforcement learning framework for optimizing coverage and connectivity in Space-Air-Ground integrated networks. This pro…☆53Feb 26, 2026Updated last week
- Spring 2017 Deep Reinforcement Learning Final Project☆30May 13, 2017Updated 8 years ago
- ☆17Feb 1, 2026Updated last month
- The environment code for the paper 'Learning-based Eco-driving Strategy Design for Connected Power-split Hybrid Electric Vehicles at Sign…☆36Aug 28, 2022Updated 3 years ago
- ns3-simulator-for-satellite☆11Jul 29, 2022Updated 3 years ago
- joint computation offloading and resource allocation in Internet of Vehicle☆87Apr 4, 2021Updated 4 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- DGIST ARTIV Repos☆16Dec 29, 2020Updated 5 years ago
- ☆11Nov 13, 2025Updated 3 months ago
- 秦志金教授论文☆11Sep 14, 2021Updated 4 years ago