Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆91Nov 21, 2023Updated 2 years ago
Alternatives and similar repositories for off-policy-continuous-control
Users that are interested in off-policy-continuous-control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Dec 27, 2020Updated 5 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆345Aug 22, 2024Updated last year
- ☆23Aug 19, 2022Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆72Jan 18, 2024Updated 2 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Dec 8, 2022Updated 3 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆31Sep 28, 2024Updated last year
- ☆25Apr 16, 2024Updated 2 years ago
- ☆10Sep 21, 2020Updated 5 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆172Jul 7, 2024Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Partially Observable Process Gym☆214Jun 12, 2025Updated 10 months ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆15Sep 8, 2023Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆171Jun 23, 2023Updated 2 years ago
- Soft Actor-Critic with advanced features☆51Apr 12, 2026Updated last week
- Creating fixed-length vectors to describe RL/GA policies☆20Oct 23, 2021Updated 4 years ago
- Solving Complex Dexterous Manipulation Tasks with Trajectory Optimisation and Reinforcement Learning☆23May 16, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35May 21, 2024Updated last year
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆336Mar 24, 2023Updated 3 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated last month
- ☆15Jun 1, 2023Updated 2 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- An Open-Ended Agentic Simulator☆60Aug 11, 2024Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆209Jun 18, 2024Updated last year
- Real time surface reconstruction☆17Jun 18, 2018Updated 7 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago