Haichao-Zhang/PEX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Haichao-Zhang/PEX)

Haichao-Zhang / PEX

Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)

☆64

Alternatives and similar repositories for PEX

Users that are interested in PEX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shlee94 / Off2OnRL
View on GitHub
☆61Feb 3, 2023Updated 3 years ago
guosyjlu / OEMA
View on GitHub
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
☆16Aug 14, 2023Updated 2 years ago
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆124Jul 31, 2024Updated last year
ikostrikov / rlpd
View on GitHub
☆411Feb 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZishunYu / Actor-Critic-Alignment
View on GitHub
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
t6-thu / H2O
View on GitHub
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
☆59Sep 24, 2023Updated 2 years ago
Cloud0723 / Offline-MLIRL
View on GitHub
☆22Dec 18, 2023Updated 2 years ago
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 3 years ago
LeapLabTHU / FamO2O
View on GitHub
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆41Oct 30, 2023Updated 2 years ago
zzmtsvv / ORL
View on GitHub
☆58Feb 8, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vivekmyers / horizon_generalization
View on GitHub
☆15Feb 5, 2025Updated last year
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
JasonMa2016 / SMODICE
View on GitHub
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…
☆30Jan 12, 2023Updated 3 years ago
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆37Jan 5, 2023Updated 3 years ago
t6-thu / H2Oplus
View on GitHub
[ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
☆13Apr 10, 2025Updated last year
zhaoyi11 / adaptive_bc
View on GitHub
☆15Jul 4, 2022Updated 4 years ago
sfujim / TD7
View on GitHub
Author's PyTorch implementation of TD7 for online and offline RL
☆169Sep 12, 2023Updated 2 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
watchernyu / REDQ
View on GitHub
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆185Nov 14, 2024Updated last year
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
gwthomas / IQL-PyTorch
View on GitHub
A PyTorch implementation of Implicit Q-Learning
☆99Oct 23, 2021Updated 4 years ago
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,370Aug 3, 2023Updated 2 years ago
young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆142Oct 15, 2022Updated 3 years ago
AIDefender / MyDiscor
View on GitHub
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14May 24, 2021Updated 5 years ago
sa-and / MCD
View on GitHub
☆12Mar 21, 2024Updated 2 years ago
YangRui2015 / RORL
View on GitHub
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆24Feb 15, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
TToTMooN / paco-mtrl
View on GitHub
☆30Jul 12, 2023Updated 3 years ago
multidqn / deep-q-trading
View on GitHub
☆31Jul 16, 2020Updated 6 years ago
OscarHuangWind / Preference-Guided-DQN-Atari
View on GitHub
[TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving
☆11Oct 9, 2023Updated 2 years ago
YangRui2015 / AWGCSL
View on GitHub
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
☆27Feb 21, 2022Updated 4 years ago
haje01 / distper
View on GitHub
Distributed Priortized Experience Replay
☆10Aug 8, 2018Updated 7 years ago