YunjiaXi / Implementation-and-Some-Modification-about-DQN-and-SAC
In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆50Updated 4 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆90Updated 2 years ago
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆16Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆161Updated last year
- A clean and robust Pytorch implementation of PPO on Discrete action space☆59Updated 5 months ago
- RL algorithms☆139Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆121Updated 5 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆73Updated 9 months ago
- Implement some algorithms of RL☆43Updated last year
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆90Updated 3 years ago
- 强化学习面试(未完待续)☆32Updated 4 years ago
- ☆71Updated 5 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated 11 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆93Updated 3 years ago
- ☆56Updated 4 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆67Updated last year
- Paper Collection for Imitation Learning in RL.☆133Updated 2 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆81Updated 6 years ago
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆50Updated 11 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆81Updated 4 years ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆55Updated last week
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆63Updated last year
- The code for maddpg using pytorch☆162Updated 4 years ago
- A collection of offline reinforcement learning algorithms.☆158Updated 5 months ago
- DGN Code☆335Updated last year
- Actor Critic model to play Cartpole game☆52Updated 6 years ago
- Multi-Agent Reinforcement Learning (MARL) papers☆212Updated 2 years ago
- DQN with pytorch with on Breakout and SpaceInvaders☆25Updated 5 years ago
- Implementation of benchmark RL algorithms☆460Updated 2 years ago