zhihanyang2022 / off-policy-continuous-controlView external linksLinks
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆90Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for off-policy-continuous-control
Users that are interested in off-policy-continuous-control are comparing it to the libraries listed below
Sorting:
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆56May 21, 2023Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆67Jan 18, 2024Updated 2 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Dec 27, 2020Updated 5 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆342Aug 22, 2024Updated last year
- ☆23Aug 19, 2022Updated 3 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- ☆10Sep 21, 2020Updated 5 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Dec 8, 2022Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆30Oct 29, 2023Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Oct 23, 2021Updated 4 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Jul 7, 2024Updated last year
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- ☆24Apr 16, 2024Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- Soft Actor-Critic with advanced features☆51Jan 4, 2026Updated last month
- Implementation of SAC and TD3 based on various RNN and Transformer.☆28Sep 28, 2024Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Aug 27, 2022Updated 3 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- Partially Observable Process Gym☆212Jun 12, 2025Updated 8 months ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆15Sep 8, 2023Updated 2 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- This is a miniature race car gym-env for RL from states (and images)☆28Nov 3, 2021Updated 4 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35May 21, 2024Updated last year
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆20Oct 19, 2025Updated 3 months ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆163Jun 23, 2023Updated 2 years ago
- Hierarchical Reinforcement Learning (batteries included)☆48Oct 12, 2019Updated 6 years ago
- A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.☆14Jan 8, 2022Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆334Mar 24, 2023Updated 2 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Oct 8, 2018Updated 7 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year