twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆318Updated 8 months ago
Alternatives and similar repositories for pomdp-baselines
Users that are interested in pomdp-baselines are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆359Updated 3 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆217Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆303Updated 9 months ago
- Partially Observable Process Gym☆189Updated 10 months ago
- (NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation☆218Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆542Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆350Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆127Updated 9 months ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆489Updated 2 years ago
- Code for conservative Q-learning☆437Updated 3 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 3 months ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆302Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆142Updated last year
- ☆195Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆217Updated 2 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆329Updated last year
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆305Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆167Updated 5 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆178Updated 2 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆315Updated 3 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆188Updated 2 years ago
- ☆343Updated 2 years ago
- ☆266Updated 3 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆201Updated 8 months ago
- Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python☆284Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆176Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆142Updated last year