Implementation of SAC and TD3 based on various RNN and Transformer.
☆30Sep 28, 2024Updated last year
Alternatives and similar repositories for Recurrent-Offpolicy-RL
Users that are interested in Recurrent-Offpolicy-RL are comparing it to the libraries listed below
Sorting:
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Jan 27, 2026Updated last month
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Data Center Environment and Reinforcement Learning (RL) Control☆22Oct 29, 2023Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆31Jul 4, 2024Updated last year
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Aug 21, 2023Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Nov 21, 2023Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- Theory of Reinforcement Learning☆18Apr 20, 2021Updated 4 years ago
- [NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"☆17Dec 7, 2024Updated last year
- ☆27Apr 22, 2024Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆72Jan 18, 2024Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- A small reinforcement learning library for my masters dissertation project☆15Aug 31, 2021Updated 4 years ago
- ☆19Oct 27, 2025Updated 4 months ago
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆40Jul 14, 2025Updated 8 months ago
- Public sourcecode for Transformable Gaussian Reward Function for Robot Navigation with Deep Reinforcement Learning☆21Aug 7, 2024Updated last year
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- [IROS 2023] Robust Unmanned Surface Vehicle Navigation with Distributional Reinforcement Learning☆81Oct 13, 2025Updated 5 months ago
- Official implementation of the ICLR 2021 paper "Differentiable Trust Region Layers for Deep Reinforcement Learning"☆11Aug 23, 2023Updated 2 years ago
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- Generate Micro-Doppler signature of human motion by radar☆12Jul 2, 2023Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆62Aug 3, 2023Updated 2 years ago
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆162Sep 12, 2023Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- ☆11Apr 8, 2024Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- ☆15May 4, 2025Updated 10 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆65Jan 2, 2026Updated 2 months ago
- ☆17Oct 25, 2023Updated 2 years ago
- ☆10Sep 19, 2023Updated 2 years ago
- ☆14Apr 3, 2023Updated 2 years ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Mar 29, 2024Updated last year
- ☆19Nov 21, 2023Updated 2 years ago