sunnyswag / RL_notes_and_codes
学习强化学习过程中的笔记和代码
☆9Updated 4 years ago
Alternatives and similar repositories for RL_notes_and_codes:
Users that are interested in RL_notes_and_codes are comparing it to the libraries listed below
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- The implement of GAIL with pytorch☆14Updated 5 years ago
- RLlib超参数详解(中文)☆16Updated 3 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- ☆29Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Experiments with transformer based RL algorithms☆22Updated 5 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- ☆30Updated 2 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆28Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆28Updated 2 years ago
- Trading Robot based on LSTM-PPO☆26Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆174Updated 8 months ago
- ☆13Updated 4 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆14Updated 5 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- ☆18Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆40Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆85Updated 4 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Updated 2 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- Multi Agent RL project (Tennis) using MADDPG for Udacity Deep Reinforcement Learning Nano Degree program☆8Updated 6 years ago