twni2016 / pomdp-baselinesLinks
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆319Updated 9 months ago
Alternatives and similar repositories for pomdp-baselines
Users that are interested in pomdp-baselines are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆361Updated 3 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆337Updated last year
- (NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation☆218Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆303Updated last year
- PyTorch implementation of SAC-Discrete.☆302Updated 10 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆168Updated 3 years ago
- ☆344Updated 2 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆314Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆217Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆545Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆352Updated 2 years ago
- ☆196Updated 2 years ago
- Partially Observable Process Gym☆190Updated 10 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆223Updated 4 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆179Updated 11 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆143Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆495Updated 2 years ago
- Code for conservative Q-learning☆446Updated 3 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆205Updated 8 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆145Updated last year
- ☆270Updated 3 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆274Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆179Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆168Updated 6 months ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆684Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆193Updated 8 months ago
- A collection of offline reinforcement learning algorithms.☆185Updated 6 months ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆316Updated 3 years ago
- Multi-objective Gymnasium environments for reinforcement learning☆326Updated 3 months ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 5 years ago