xbpeng/awr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xbpeng/awr)

xbpeng / awr

Implementation of advantage-weighted regression.

☆211

Alternatives and similar repositories for awr

Users that are interested in awr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aviralkumar2907 / BEAR
View on GitHub
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆164Jul 17, 2020Updated 6 years ago
UBCMOCCA / mocca_envs
View on GitHub
☆18Dec 3, 2020Updated 5 years ago
aviralkumar2907 / CQL
View on GitHub
Code for conservative Q-learning
☆486Dec 7, 2021Updated 4 years ago
sfujim / BCQ
View on GitHub
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆667Apr 6, 2021Updated 5 years ago
Farama-Foundation / D4RL-Evaluations
View on GitHub
☆203Mar 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tianheyu927 / mopo
View on GitHub
Code for MOPO: Model-based Offline Policy Optimization
☆191May 17, 2022Updated 4 years ago
google-research / batch_rl
View on GitHub
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆560Jun 26, 2023Updated 3 years ago
ikostrikov / implicit_q_learning
View on GitHub
☆330Jan 23, 2022Updated 4 years ago
pokaxpoka / sunrise
View on GitHub
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆131Mar 21, 2021Updated 5 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
justinjfu / diagnosing_qlearning
View on GitHub
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆17May 14, 2019Updated 7 years ago
jannerm / mbpo
View on GitHub
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆558Nov 22, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Farama-Foundation / D4RL
View on GitHub
A collection of reference environments for offline reinforcement learning
☆1,694Nov 18, 2024Updated last year
WilsonWangTHU / mbbl
View on GitHub
☆399Jul 18, 2019Updated 7 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
davidbrandfonbrener / onestep-rl
View on GitHub
☆44Sep 19, 2021Updated 4 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
Wenxuan-Zhou / PLAS
View on GitHub
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
☆54Oct 18, 2021Updated 4 years ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 3 weeks ago
KamyarGh / rl_swiss
View on GitHub
☆66May 25, 2020Updated 6 years ago
NVlabs / sim-parameter-estimation
View on GitHub
The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…
☆30Nov 20, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MishaLaskin / curl
View on GitHub
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆605Oct 28, 2020Updated 5 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago
nnaisense / MAX
View on GitHub
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆81Jul 23, 2019Updated 7 years ago
jerrylin1121 / cross_entropy_method
View on GitHub
Implementation of Cross Entropy Method
☆16Oct 1, 2018Updated 7 years ago
kchua / handful-of-trials
View on GitHub
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆475Jul 6, 2023Updated 3 years ago
mila-iqia / spr
View on GitHub
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆167Dec 21, 2021Updated 4 years ago
taochenshh / hcp
View on GitHub
(NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning
☆20Apr 8, 2019Updated 7 years ago
erwincoumans / motion_imitation
View on GitHub
Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"
☆1,451Mar 24, 2023Updated 3 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
iexarchos / PolicyTransferKinDRA
View on GitHub
Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"
☆12Apr 28, 2021Updated 5 years ago
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 7 years ago
rail-berkeley / rlkit
View on GitHub
Collection of reinforcement learning algorithms
☆2,922Jun 17, 2024Updated 2 years ago
rail-berkeley / softlearning
View on GitHub
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,434Nov 29, 2023Updated 2 years ago
matsuolab / BREMEN
View on GitHub
Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)
☆54Jul 7, 2021Updated 5 years ago
RLAgent / state-marginal-matching
View on GitHub
Efficient Exploration via State Marginal Matching (2019)
☆70Jun 30, 2019Updated 7 years ago
astooke / rlpyt
View on GitHub
Reinforcement Learning in PyTorch
☆2,278Jan 4, 2021Updated 5 years ago