georgesung / TD3
Benchmarking TD3 and DDPG on PyBullet
☆52Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for TD3
- Convert DeepMind Control Suite to OpenAI gym environments.☆83Updated 4 years ago
- A standalone library to randomize various OpenAI Gym Environments☆60Updated 5 years ago
- ☆110Updated last year
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆205Updated 6 months ago
- OpenAI Gym Wrapper for DeepMind Control Suite☆71Updated 2 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 4 years ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆75Updated 11 months ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- Hindsight policy gradients☆43Updated 4 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- impact-driven-exploration☆128Updated last year
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- Revisiting Rainbow☆73Updated 3 years ago
- rllab's viskit with some added features☆73Updated last year
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- Multitask Environments for RL☆274Updated 3 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- reinforcement learning from randomized simulations☆64Updated last week
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- ☆25Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- ☆20Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- ☆97Updated last year
- implementation of our self-guided and self-regularized actor-critic algorithm☆30Updated last year