Implementation of SAC and TD3 based on various RNN and Transformer.
☆32Sep 28, 2024Updated last year
Alternatives and similar repositories for Recurrent-Offpolicy-RL
Users that are interested in Recurrent-Offpolicy-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Mar 19, 2026Updated last month
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- ☆15Dec 5, 2024Updated last year
- Data Center Environment and Reinforcement Learning (RL) Control☆24Oct 29, 2023Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- RLA is a tool for managing your RL experiments automatically☆31Jan 11, 2025Updated last year
- ☆10Mar 22, 2021Updated 5 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆91Nov 21, 2023Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- Theory of Reinforcement Learning☆18Apr 20, 2021Updated 5 years ago
- [NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"☆17Dec 7, 2024Updated last year
- ☆27Apr 22, 2024Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆73Updated this week
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A small reinforcement learning library for my masters dissertation project☆15Aug 31, 2021Updated 4 years ago
- ☆19Oct 27, 2025Updated 6 months ago
- RLA is a tool for managing your RL experiments automatically☆71Feb 7, 2023Updated 3 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago
- ☆15Mar 12, 2022Updated 4 years ago
- # Analyzing-Visualizing-Data-PowerBI ☆12Jun 3, 2024Updated last year
- Official implementation of the ICLR 2021 paper "Differentiable Trust Region Layers for Deep Reinforcement Learning"☆11Aug 23, 2023Updated 2 years ago
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- [IROS 2023] Robust Unmanned Surface Vehicle Navigation with Distributional Reinforcement Learning☆87Oct 13, 2025Updated 6 months ago
- Generate Micro-Doppler signature of human motion by radar☆12Jul 2, 2023Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆62Aug 3, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆163Sep 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Mar 11, 2024Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated 2 years ago
- ☆15May 4, 2025Updated 11 months ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆65Jan 2, 2026Updated 4 months ago
- ☆10Sep 19, 2023Updated 2 years ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Mar 29, 2024Updated 2 years ago