Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆90Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for off-policy-continuous-control
Users that are interested in off-policy-continuous-control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Dec 27, 2020Updated 5 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆343Aug 22, 2024Updated last year
- ☆23Aug 19, 2022Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆72Jan 18, 2024Updated 2 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Dec 8, 2022Updated 3 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆30Sep 28, 2024Updated last year
- ☆25Apr 16, 2024Updated last year
- ☆10Sep 21, 2020Updated 5 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆172Jul 7, 2024Updated last year
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆30Jan 22, 2022Updated 4 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Partially Observable Process Gym☆213Jun 12, 2025Updated 9 months ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆15Sep 8, 2023Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆167Jun 23, 2023Updated 2 years ago
- Soft Actor-Critic with advanced features☆51Mar 2, 2026Updated 3 weeks ago
- Creating fixed-length vectors to describe RL/GA policies☆20Oct 23, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Solving Complex Dexterous Manipulation Tasks with Trajectory Optimisation and Reinforcement Learning☆23May 16, 2021Updated 4 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35May 21, 2024Updated last year
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆336Mar 24, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆18Sep 7, 2023Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 3 weeks ago
- ☆15Jun 1, 2023Updated 2 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- OpenRAN Gym website☆12Dec 11, 2025Updated 3 months ago
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆207Jun 18, 2024Updated last year