aviralkumar2907/BEAR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aviralkumar2907/BEAR)

aviralkumar2907 / BEAR

Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction

☆164

Alternatives and similar repositories for BEAR

Users that are interested in BEAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sfujim / BCQ
View on GitHub
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆667Apr 6, 2021Updated 5 years ago
aviralkumar2907 / CQL
View on GitHub
Code for conservative Q-learning
☆486Dec 7, 2021Updated 4 years ago
Farama-Foundation / D4RL-Evaluations
View on GitHub
☆203Mar 25, 2023Updated 3 years ago
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 7 years ago
xbpeng / awr
View on GitHub
Implementation of advantage-weighted regression.
☆211May 30, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
tianheyu927 / mopo
View on GitHub
Code for MOPO: Model-based Offline Policy Optimization
☆191May 17, 2022Updated 4 years ago
google-research / batch_rl
View on GitHub
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆560Jun 26, 2023Updated 3 years ago
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,694Nov 18, 2024Updated last year
ryanxhr / DWBC
View on GitHub
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆36Jan 5, 2023Updated 3 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
rail-berkeley / rlkit
View on GitHub
Collection of reinforcement learning algorithms
☆2,922Jun 17, 2024Updated 2 years ago
jannerm / mbpo
View on GitHub
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆558Nov 22, 2022Updated 3 years ago
snu-mllab / EDAC
View on GitHub
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆80Aug 14, 2022Updated 3 years ago
roosephu / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Jul 26, 2019Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
secury / optidice
View on GitHub
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆16Aug 3, 2023Updated 2 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
hanjuku-kaso / awesome-offline-rl
View on GitHub
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,073May 23, 2024Updated 2 years ago
haarnoja / softqlearning
View on GitHub
Reinforcement Learning with Deep Energy-Based Policies
☆438Nov 28, 2023Updated 2 years ago
pokaxpoka / sunrise
View on GitHub
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆131Mar 21, 2021Updated 5 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
MishaLaskin / curl
View on GitHub
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆605Oct 28, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
DesikRengarajan / LOGO
View on GitHub
[ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
☆28Feb 10, 2022Updated 4 years ago
sfujim / TD3
View on GitHub
Author's PyTorch implementation of TD3 for OpenAI gym tasks
☆2,096Jul 14, 2023Updated 3 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
takuseno / d3rlpy
View on GitHub
An offline deep reinforcement learning library
☆1,675Sep 10, 2025Updated 10 months ago
davidbrandfonbrener / onestep-rl
View on GitHub
☆44Sep 19, 2021Updated 4 years ago
young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆141Oct 15, 2022Updated 3 years ago
yhyu13 / C51-DDPG
View on GitHub
This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)
☆11Sep 14, 2017Updated 8 years ago
ermongroup / MetaIRL
View on GitHub
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
☆77Mar 16, 2023Updated 3 years ago
ryanxhr / POR
View on GitHub
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆58Apr 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rail-berkeley / softlearning
View on GitHub
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,434Nov 29, 2023Updated 2 years ago
google-research / realworldrl_suite
View on GitHub
Real-World RL Benchmark Suite
☆365Aug 11, 2020Updated 5 years ago
Ji4chenLi / Multi-Task-Batch-RL
View on GitHub
☆26Mar 16, 2023Updated 3 years ago
RLAgent / state-marginal-matching
View on GitHub
Efficient Exploration via State Marginal Matching (2019)
☆70Jun 30, 2019Updated 7 years ago
toshikwa / discor.pytorch
View on GitHub
PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.
☆37Jun 22, 2022Updated 4 years ago
junhyukoh / self-imitation-learning
View on GitHub
ICML 2018 Self-Imitation Learning
☆277Apr 18, 2020Updated 6 years ago
WilsonWangTHU / mbbl
View on GitHub
☆399Jul 18, 2019Updated 7 years ago