YunjiaXi / Implementation-and-Some-Modification-about-DQN-and-SACLinks
In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆11Updated 4 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below
Sorting:
- ☆312Updated 2 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆53Updated 5 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆173Updated last year
- Implementation of benchmark RL algorithms☆467Updated 2 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆13Updated 5 years ago
- 强化学习面试(未完待续)☆35Updated 5 years ago
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆15Updated 5 years ago
- basic algorithms of reinforcement learning☆212Updated last year
- ☆10Updated 4 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆113Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆76Updated 7 months ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- A beamer template for LAMDA lab at NJU☆14Updated 4 years ago
- A collection of offline reinforcement learning algorithms.☆189Updated 7 months ago
- DQN to play Atari Pong☆114Updated 6 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- ☆166Updated last year
- The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆56Updated last year
- ☆124Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆69Updated last year
- The code of paper Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. Zhihai Wang, Jie Wang*, Qi Zhou, Bin…☆20Updated 3 years ago
- Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch☆253Updated 4 years ago
- ☆97Updated 4 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 5 years ago
- Code for conservative Q-learning☆450Updated 3 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 2 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- RL algorithms☆142Updated 4 years ago