Dragon-Zhuang/BPPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dragon-Zhuang/BPPO)

Dragon-Zhuang / BPPO

Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).

☆95

Alternatives and similar repositories for BPPO

Users that are interested in BPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
zhaoyi11 / adaptive_bc
View on GitHub
☆15Jul 4, 2022Updated 4 years ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 3 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
yihaosun1124 / OfflineRL-Kit
View on GitHub
An elegant PyTorch offline reinforcement learning library for researchers.
☆393May 2, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Facebear-ljx / PROTO
View on GitHub
☆17May 25, 2023Updated 3 years ago
YangRui2015 / RORL
View on GitHub
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆24Feb 15, 2023Updated 3 years ago
liuzuxin / OSRL
View on GitHub
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆246Sep 13, 2024Updated last year
Lei-Kun / Uni-O4
View on GitHub
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆82Jan 15, 2025Updated last year
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,694Nov 18, 2024Updated last year
polixir / morec
View on GitHub
☆10Mar 11, 2024Updated 2 years ago
davidbrandfonbrener / onestep-rl
View on GitHub
☆44Sep 19, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
imoneoi / RSP_JAX
View on GitHub
[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?
☆15Dec 10, 2024Updated last year
microsoft / lightATAC
View on GitHub
A lightweight reimplementation of Adversarially Trained Actor Critic
☆19Mar 19, 2026Updated 4 months ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,370Aug 3, 2023Updated 2 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
Dragon-Zhuang / Reinformer
View on GitHub
Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL
☆49Oct 16, 2024Updated last year
sfujim / TD7
View on GitHub
Author's PyTorch implementation of TD7 for online and offline RL
☆169Sep 12, 2023Updated 2 years ago
thu-ml / CEP-energy-guided-diffusion
View on GitHub
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction
☆35Nov 3, 2023Updated 2 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ChenDRAG / SfBC
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.or…
☆43Oct 11, 2023Updated 2 years ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆57May 21, 2023Updated 3 years ago
ryanxhr / POR
View on GitHub
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆58Apr 6, 2023Updated 3 years ago
aielawady / relic
View on GitHub
☆12Sep 7, 2024Updated last year
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
ZhengYinan-AIR / OMIGA
View on GitHub
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…
☆44Mar 3, 2024Updated 2 years ago
akiani / rlsepsis234
View on GitHub
CS234 Sepsis Simulator For RL
☆18Dec 8, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
ai4co / devformer
View on GitHub
[ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"
☆23Dec 7, 2024Updated last year
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆321Jul 21, 2025Updated last year
alexanderbaumann99 / PPO-Algorithms
View on GitHub
Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' env…
☆13Nov 14, 2021Updated 4 years ago