ryanxhr/BEAR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ryanxhr/BEAR)

ryanxhr / BEAR

Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"

☆11

Alternatives and similar repositories for BEAR

Users that are interested in BEAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qingyun-wu / NonstationaryBanditLib
View on GitHub
☆15Jan 20, 2020Updated 6 years ago
lyingCS / Controllable-Multi-Objective-Reranking
View on GitHub
Controllable Multi-Objective Re-ranking with Policy Hypernetworks (KDD 2023)
☆40Oct 6, 2024Updated last year
ryoungj / ZO-L2L
View on GitHub
[ICLR'20] Learning to Learn by Zeroth-Order Oracle
☆14Feb 7, 2020Updated 6 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ying-wen / rlchina_pbl
View on GitHub
☆10Aug 18, 2022Updated 3 years ago
microsoft / smart
View on GitHub
Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"
☆54Jan 26, 2024Updated 2 years ago
brown-palm / GCPC
View on GitHub
Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)
☆14Dec 8, 2023Updated 2 years ago
Snnzhao / DAHCR
View on GitHub
This is the official implementation for IJCAI 2023 Paper: Towards Hierarchical Policy Learning for Conversational Recommendation with Hyp…
☆12Sep 19, 2023Updated 2 years ago
Stilwell-Git / Hindsight-Goal-Generation
View on GitHub
TensorFlow implementation for our paper "Exploration via Hindsight Goal Generation"
☆23Mar 11, 2022Updated 4 years ago
E-qin / GEAR
View on GitHub
Open-source code for GEAR
☆16Dec 3, 2025Updated 7 months ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
LucasAlegre / mbcd
View on GitHub
Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"
☆11Aug 7, 2023Updated 2 years ago
JacksonWuxs / 19nCoV-SEIR-Estimation
View on GitHub
An adjustive SEIR model to estimate parameters of 2019-nCoV
☆19Jun 22, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
roosephu / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Jul 26, 2019Updated 6 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
WangXFng / NFARec
View on GitHub
[SIGIR 2024] NFARec: A Negative Feedback-Aware Recommender Model.
☆13Jan 9, 2025Updated last year
tsinghua-fib-lab / WTG-DVR
View on GitHub
The official implementation of "DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias" (MM '22)
☆18Oct 15, 2022Updated 3 years ago
KMarino / hrl-ep3
View on GitHub
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Feb 21, 2019Updated 7 years ago
rems75 / SPIBB-DQN
View on GitHub
Code for SPIBB-DQN and Soft-SPIBB-DQN
☆11May 5, 2020Updated 6 years ago
nigelyaoj / Quality-Similar-Diversity
View on GitHub
Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning
☆19Dec 26, 2025Updated 6 months ago
Hyeokreal / ali_bigan_mnist_pytorch
View on GitHub
☆10Aug 8, 2017Updated 8 years ago
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chenhaokun / TPGR
View on GitHub
python implementation of the TPGR
☆40Mar 27, 2019Updated 7 years ago
ChenDRAG / SfBC
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.or…
☆42Oct 11, 2023Updated 2 years ago
mjamroz / PlantRecognition
View on GitHub
Example of android app written in Qt/Qml which uses MXNet for plant image recognition.
☆10Nov 4, 2017Updated 8 years ago
AurelianTactics / bcq_tensorflow
View on GitHub
☆15May 24, 2021Updated 5 years ago
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
aviralkumar2907 / BEAR
View on GitHub
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆164Jul 17, 2020Updated 6 years ago
gokererdogan / Notebooks
View on GitHub
IPython Notebooks on various things
☆14Dec 4, 2017Updated 8 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
czxttkl / DraftArtist
View on GitHub
☆11Oct 19, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hautahi / IM_RIS
View on GitHub
Source code for blog post at https://hautahi.com/im_ris
☆12Jan 8, 2019Updated 7 years ago
Ktakuya332C / deepcube
View on GitHub
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Dec 9, 2018Updated 7 years ago
bhairavmehta95 / ant-env
View on GitHub
Ant Gather and Ant Maze envs, separated from RLLab
☆11Aug 2, 2018Updated 7 years ago
HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated this week
martius-lab / learningwithmuscles
View on GitHub
Repo for the paper: Learning with Muscles: Benefits for Data-Efficiency and Robustness in Anthropomorphic Tasks. https://al.is.mpg.de/pub…
☆16Dec 1, 2022Updated 3 years ago
Xiaoyinggit / ConUCB
View on GitHub
☆11Aug 10, 2020Updated 5 years ago
ramp-kits / rl_simulator
View on GitHub
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Mar 13, 2026Updated 4 months ago