lzhan144 / Solving-CarRacing-with-DDPG
☆11Updated 5 years ago
Alternatives and similar repositories for Solving-CarRacing-with-DDPG:
Users that are interested in Solving-CarRacing-with-DDPG are comparing it to the libraries listed below
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆23Updated 5 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆44Updated 7 months ago
- ☆67Updated 2 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆45Updated 5 years ago
- Public version of the decentralized, attention-based mTSP code☆36Updated 3 years ago
- OpenAI Gym environment designed for training RL agents to control the flight of a two-dimensional drone.☆50Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆42Updated 7 months ago
- PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method☆29Updated 4 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- An improvement of CarRacing-v0 from OpenAI Gym in order to make the environment complex enough for Hierarchical Reinforcement Learning☆71Updated last year
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆65Updated 7 months ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆73Updated 7 months ago
- DecentralizedLearning☆24Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Updated 5 years ago
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆53Updated 6 years ago
- Multi agent PPO implementation in Pytorch for Unity ML Agents environments.☆26Updated 9 months ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆10Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆140Updated 11 months ago
- Heterogeneous Multi-Robot Reinforcement Learning☆47Updated 7 months ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆41Updated 7 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆65Updated 3 years ago
- A list of safe reinforcement learning papers☆20Updated 5 years ago
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆15Updated last year
- Implementation of the Nash Q-Learning algorithm to solve simple MARL problems with two agents.☆22Updated 2 years ago
- OpenAI Gym interfaces for multi-robot flocking problems☆40Updated 3 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆22Updated 4 years ago