YunjiaXi / Implementation-and-Some-Modification-about-DQN-and-SACLinks
In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆11Updated 4 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below
Sorting:
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆52Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆172Updated last year
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆15Updated 4 years ago
- 强化学习面试(未完待续)☆35Updated 5 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆86Updated 7 years ago
- The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆56Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- ☆52Updated 6 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆76Updated 6 months ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆70Updated last year
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆13Updated 5 years ago
- RL algorithms☆142Updated 4 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆59Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆109Updated 4 years ago
- ☆44Updated 4 years ago
- ☆124Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆189Updated 7 months ago
- ☆131Updated 11 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆86Updated 2 years ago
- ☆22Updated 6 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆53Updated 2 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆15Updated 6 years ago
- ☆165Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆111Updated 3 years ago
- ☆95Updated 4 years ago
- ☆76Updated 5 years ago