hmomin / TD3-Bipedal-Walker
Trains an agent with Twin Delayed Deep Deterministic Policy Gradient (TD3) to solve the Bipedal Walker challenge from OpenAI
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TD3-Bipedal-Walker
- ☆277Updated 6 months ago
- ☆76Updated last year
- ☆143Updated 6 months ago
- PPO, DDPG, SAC implementation on mujoco environment☆90Updated 2 years ago
- Official implementation of Diffusion Policy Policy Optimization, arxiv 2024☆227Updated this week
- Official implementation of the BRO algorithm☆10Updated 3 weeks ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆135Updated 2 years ago
- DDPG + HER implementation in PyTorch for FetchSlide Robot☆18Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learning☆66Updated 3 years ago
- ☆43Updated 10 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆161Updated last year
- ☆19Updated 5 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆328Updated 2 years ago
- ☆10Updated 3 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year
- ☆305Updated last year
- Implementation of Dreamer v3 in pytorch.☆428Updated last month
- DDPG with Hindsight Experience Replay (HER) solving Openai gym Fetch robotic environment in Pytorch☆13Updated 3 years ago
- Set of robotic environments based on PyBullet physics engine and gymnasium.☆576Updated 4 months ago
- This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym☆662Updated 5 months ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆176Updated 2 months ago
- Code and website for Behavior Transformers: Cloning k modes with one stone.☆109Updated last year
- what if removed adversiral loss from adversarial motion piror? a pairwise motion piror solution inspired by https://arxiv.org/abs/1706.0…☆11Updated 2 months ago
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆449Updated 2 months ago
- Related papers for reinforcement learning, including classic papers and latest papers in top conferences☆312Updated this week
- ☆402Updated 2 months ago
- This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"☆92Updated 2 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆124Updated 6 months ago
- Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regulariz…☆23Updated 5 months ago