FanmingL/Recurrent-Offpolicy-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FanmingL/Recurrent-Offpolicy-RL)

FanmingL / Recurrent-Offpolicy-RL

Implementation of SAC and TD3 based on various RNN and Transformer.

☆28

Alternatives and similar repositories for Recurrent-Offpolicy-RL

Users that are interested in Recurrent-Offpolicy-RL are comparing it to the libraries listed below

Sorting:

liyc-ai / RL-pytorch
View on GitHub
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
☆26Jan 27, 2026Updated last month
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated 11 months ago
polixir / RLAssistant
View on GitHub
RLA is a tool for managing your RL experiments automatically
☆32Jan 11, 2025Updated last year
koshachya-myata / Data_Center_Simulation
View on GitHub
Data Center Environment and Reinforcement Learning (RL) Control
☆22Oct 29, 2023Updated 2 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
polixir / NeoRL2
View on GitHub
☆19Oct 27, 2025Updated 4 months ago
ambujtewari / stats701-winter2021
View on GitHub
Theory of Reinforcement Learning
☆18Apr 20, 2021Updated 4 years ago
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated 11 months ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
NPCLEI / KungFuAthleteBot
View on GitHub
a kungfu Dataset for humanoid robot
☆45Feb 21, 2026Updated last week
facebookresearch / ExPLORe
View on GitHub
This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".
☆25Dec 5, 2023Updated 2 years ago
zhihanyang2022 / off-policy-continuous-control
View on GitHub
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆90Nov 21, 2023Updated 2 years ago
yihaosun1124 / mobile
View on GitHub
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆23Apr 17, 2024Updated last year
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆62Aug 3, 2023Updated 2 years ago
tinker495 / jax-baseline
View on GitHub
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆63Jan 2, 2026Updated 2 months ago
Zzl35 / flow-to-better
View on GitHub
☆27Apr 22, 2024Updated last year
zzmtsvv / ORL
View on GitHub
☆58Feb 8, 2025Updated last year
xionghuichen / MAPLE
View on GitHub
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
☆25Jan 16, 2024Updated 2 years ago
twni2016 / Memory-RL
View on GitHub
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆70Jan 18, 2024Updated 2 years ago
samvelyan / minihack
View on GitHub
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
☆39Jul 14, 2025Updated 7 months ago
RobustFieldAutonomyLab / Distributional_RL_Navigation
View on GitHub
[IROS 2023] Robust Unmanned Surface Vehicle Navigation with Distributional Reinforcement Learning
☆80Oct 13, 2025Updated 4 months ago
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 2 years ago
chongyi-zheng / td_infonce
View on GitHub
Implementations of Temporal Difference InfoNCE (TD InfoNCE)
☆33Nov 13, 2023Updated 2 years ago
chenran-li / RQL-release
View on GitHub
(NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value
☆35Mar 29, 2024Updated last year
rll-research / sim2seg
View on GitHub
Implementation of Sim2Seg (John So*, Amber Xie*, Sunggoo Jung, Jeffrey Edlund, Rohan Thakker, Ali-akbar Agha-mohammad, Pieter Abbeel, Ste…
☆36Aug 26, 2023Updated 2 years ago
xionghuichen / RLAssistant
View on GitHub
RLA is a tool for managing your RL experiments automatically
☆72Feb 7, 2023Updated 3 years ago
Prasham-Patel / CARLA_Motion_Planning_Project
View on GitHub
☆12May 29, 2022Updated 3 years ago
PaperBoardOfficial / cursor-linux-packages
View on GitHub
☆20Oct 18, 2025Updated 4 months ago
HxLyn3 / Machine-Learning
View on GitHub
Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou
☆11Jul 20, 2021Updated 4 years ago
HxLyn3 / Faster-RCNN
View on GitHub
Faster RCNN using TensorFlow
☆10Jul 31, 2022Updated 3 years ago
x35f / unstable_baselines
View on GitHub
Re-implementations of SOTA RL algorithms.
☆136Sep 7, 2023Updated 2 years ago
ucla-rlcourse / competitive-rl
View on GitHub
A set of competitive environments for Reinforcement Learning research.
☆30Dec 1, 2022Updated 3 years ago
sfujim / TD7
View on GitHub
Author's PyTorch implementation of TD7 for online and offline RL
☆161Sep 12, 2023Updated 2 years ago
PeixiLiu / humanMotionRadar
View on GitHub
Generate Micro-Doppler signature of human motion by radar
☆12Jul 2, 2023Updated 2 years ago
ExploreIntelligence / RL-MultiAgentSystem
View on GitHub
Reference code for the paper ""Centroid-Guided Target-Driven Topology Control Method for UAV Ad-Hoc Networks Based on Tiny Deep Reinforce…
☆10Oct 21, 2024Updated last year
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Feb 20, 2026Updated last week
Jaeik-Jeong / DeepBid
View on GitHub
Deep Reinforcement Learning based Real-time Renewable Energy Bidding with Battery Control
☆16Jul 13, 2025Updated 7 months ago
cmubig / socialAttention
View on GitHub
☆11Apr 8, 2024Updated last year
AssistiveRoboticsUNH / bc_tutorial
View on GitHub
Getting Started in Imitation Learning
☆13Mar 3, 2025Updated 11 months ago