Implementation of SAC and TD3 based on various RNN and Transformer.
☆31Sep 28, 2024Updated last year
Alternatives and similar repositories for Recurrent-Offpolicy-RL
Users that are interested in Recurrent-Offpolicy-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Mar 19, 2026Updated 3 weeks ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Data Center Environment and Reinforcement Learning (RL) Control☆23Oct 29, 2023Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- RLA is a tool for managing your RL experiments automatically☆31Jan 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆10Mar 22, 2021Updated 5 years ago
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Aug 21, 2023Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆91Nov 21, 2023Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- Theory of Reinforcement Learning☆18Apr 20, 2021Updated 4 years ago
- [NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"☆17Dec 7, 2024Updated last year
- ☆27Apr 22, 2024Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆72Jan 18, 2024Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A small reinforcement learning library for my masters dissertation project☆15Aug 31, 2021Updated 4 years ago
- ☆19Oct 27, 2025Updated 5 months ago
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆34Nov 13, 2023Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆41Jul 14, 2025Updated 8 months ago
- Public sourcecode for Transformable Gaussian Reward Function for Robot Navigation with Deep Reinforcement Learning☆21Aug 7, 2024Updated last year
- The official code base of Cautiously-Optimistic kNowledge Sharing (AAAI 2024)☆12Jun 3, 2024Updated last year
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of the ICLR 2021 paper "Differentiable Trust Region Layers for Deep Reinforcement Learning"☆11Aug 23, 2023Updated 2 years ago
- [IROS 2023] Robust Unmanned Surface Vehicle Navigation with Distributional Reinforcement Learning☆84Oct 13, 2025Updated 5 months ago
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- Generate Micro-Doppler signature of human motion by radar☆12Jul 2, 2023Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆62Aug 3, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆163Sep 12, 2023Updated 2 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- ☆11Apr 8, 2024Updated 2 years ago
- ☆44Apr 8, 2025Updated last year
- ☆15May 4, 2025Updated 11 months ago
- ☆27May 1, 2025Updated 11 months ago
- This is official code for ASFL.☆22Mar 3, 2025Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆65Jan 2, 2026Updated 3 months ago