ltlhuuu/A2PR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ltlhuuu/A2PR)

ltlhuuu / A2PR

[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch

☆34

Alternatives and similar repositories for A2PR

Users that are interested in A2PR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
ltlhuuu / PSEC
View on GitHub
[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…
☆65Feb 12, 2025Updated last year
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
t6-thu / xTED
View on GitHub
[AAMAS'26] xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
☆26Jan 8, 2026Updated 6 months ago
t6-thu / H2Oplus
View on GitHub
[ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
☆13Apr 10, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
imoneoi / RSP_JAX
View on GitHub
[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?
☆15Dec 10, 2024Updated last year
Facebear-ljx / SBAC
View on GitHub
Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)
☆11Jul 4, 2022Updated 4 years ago
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
THU-AIR-DREAM / D2C
View on GitHub
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
☆32Oct 18, 2023Updated 2 years ago
pcheng2 / TSRL
View on GitHub
☆23Nov 3, 2023Updated 2 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
Facebear-ljx / RGM
View on GitHub
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
☆16Mar 3, 2023Updated 3 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
ryanxhr / POR
View on GitHub
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆58Apr 6, 2023Updated 3 years ago
charleshsc / QT
View on GitHub
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆38Dec 30, 2024Updated last year
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
ruoqizzz / entropy-offlineRL
View on GitHub
code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"
☆21Feb 24, 2024Updated 2 years ago
Improbable-AI / dw-offline-rl
View on GitHub
Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
☆25Jan 29, 2024Updated 2 years ago
ZhengYinan-AIR / FISOR
View on GitHub
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
☆129Feb 11, 2025Updated last year
RoboDD / drone_policy
View on GitHub
Pseudocode of "Champion-Level Drone Racing Using Deep Reinforcement Learning" work
☆18Dec 30, 2023Updated 2 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Dstate / LBP
View on GitHub
[ICML 2025] The Official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"
☆30Dec 15, 2025Updated 7 months ago
t6-thu / awesome-cross-domain-policy-transfer-for-embodied-agents
View on GitHub
[IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents
☆65Feb 14, 2025Updated last year
YangRui2015 / RORL
View on GitHub
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆24Feb 15, 2023Updated 3 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
gwthomas / IQL-PyTorch
View on GitHub
A PyTorch implementation of Implicit Q-Learning
☆99Oct 23, 2021Updated 4 years ago
2toinf / IVM
View on GitHub
[NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"
☆42Nov 15, 2024Updated last year
thu-ml / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction
☆35Nov 3, 2023Updated 2 years ago
ZhengYinan-AIR / OMIGA
View on GitHub
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…
☆44Mar 3, 2024Updated 2 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
LAMDA-RL / ImagineBench
View on GitHub
A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.
☆15Nov 4, 2025Updated 8 months ago
secury / optidice
View on GitHub
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆16Aug 3, 2023Updated 2 years ago
imoneoi / onerl
View on GitHub
One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework
☆21Oct 25, 2023Updated 2 years ago
Facebear-ljx / DOGE
View on GitHub
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
☆44Mar 6, 2023Updated 3 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago