sfujim / BCQLinks

Author's PyTorch implementation of BCQ for continuous and discrete actions

☆636

Alternatives and similar repositories for BCQ

Users that are interested in BCQ are comparing it to the libraries listed below

Sorting:

aviralkumar2907 / CQL
Code for conservative Q-learning
☆450Updated 3 years ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆504Updated 2 years ago
google-research / batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆553Updated 2 years ago
denisyarats / pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
☆555Updated 3 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
katerakelly / oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
☆496Updated 2 years ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆366Updated 3 years ago
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated last year
rlcode / per
Prioritized Experience Replay (PER) implementation in PyTorch
☆345Updated 5 years ago
reinforcement-learning-kr / lets-do-irl
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
☆760Updated last year
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆316Updated 2 years ago
pranz24 / pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
☆897Updated 2 weeks ago
ikostrikov / pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
☆441Updated 6 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago
mlii / mfrl
Mean Field Multi-Agent Reinforcement Learning
☆397Updated 5 years ago
StepNeverStop / RLs
Reinforcement Learning Algorithms Based on PyTorch
☆449Updated 3 years ago
zhangchuheng123 / Reinforcement-Implementation
Implementation of benchmark RL algorithms
☆467Updated 3 years ago
shariqiqbal2810 / MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
☆759Updated 3 years ago
rail-berkeley / softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,321Updated last year
openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆435Updated 2 years ago
sisl / MADRL
Repo containing code for multi-agent deep reinforcement learning (MADRL).
☆709Updated 2 years ago
TianhongDai / hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
☆433Updated 3 years ago
shariqiqbal2810 / maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
☆643Updated 5 years ago
ChenglongChen / pytorch-DRL
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
☆595Updated 7 years ago
keiohta / tf2rl
TensorFlow2 Reinforcement Learning
☆474Updated 3 years ago
ShawK91 / Evolutionary-Reinforcement-Learning
Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…
☆236Updated 4 years ago
Khrylx / PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…
☆1,233Updated 4 years ago
MorvanZhou / pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
☆649Updated 2 years ago
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆191Updated 8 months ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆182Updated last year