tesslerc/GAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tesslerc/GAC)

tesslerc / GAC

Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"

☆22

Alternatives and similar repositories for GAC

Users that are interested in GAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RajGhugare19 / VE-principle-for-model-based-RL
View on GitHub
Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…
☆18Apr 13, 2021Updated 5 years ago
PhilippeMorere / EMU-Q
View on GitHub
Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.
☆10Nov 8, 2018Updated 7 years ago
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago
chenhongge / StateAdvDRL
View on GitHub
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
☆145Nov 16, 2021Updated 4 years ago
jparkerholder / ASEBO
View on GitHub
Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …
☆16Oct 14, 2020Updated 5 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
uncharted-technologies / robust-domain-randomization
View on GitHub
Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"
☆12Nov 22, 2022Updated 3 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
wangyuhuix / TRGPPO
View on GitHub
☆34Nov 21, 2022Updated 3 years ago
YangRui2015 / AWGCSL
View on GitHub
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
☆27Feb 21, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dspub99 / betazero
View on GitHub
Tabula Rasa Tic-Tac-Toe
☆10Jan 3, 2019Updated 7 years ago
Stilwell-Git / Randomized-Return-Decomposition
View on GitHub
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
☆19Mar 17, 2022Updated 4 years ago
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
tesslerc / ActionRobustRL
View on GitHub
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆48Apr 14, 2019Updated 7 years ago
roosephu / boots
View on GitHub
☆11Oct 14, 2019Updated 6 years ago
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
dannysdeng / dqn-pytorch
View on GitHub
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Jul 26, 2019Updated 6 years ago
IndustAI / risk-and-uncertainty
View on GitHub
Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"
☆11Oct 3, 2023Updated 2 years ago
YangRui2015 / Modular_HER
View on GitHub
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
☆17Jun 23, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CDMCH / gym-fetch-stack
View on GitHub
☆24Aug 9, 2022Updated 3 years ago
pairlab / d2rl
View on GitHub
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆40Jan 22, 2021Updated 5 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆46Oct 29, 2020Updated 5 years ago
rllab-snu / Spectral-Risk-Constrained-RL
View on GitHub
Official Github Repository for "Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees". (NeurIPS 2024)
☆11Nov 30, 2025Updated 7 months ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
tesatory / hsp
View on GitHub
Hierarchical Self-Play
☆21Dec 5, 2018Updated 7 years ago
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
kindredresearch / arp
View on GitHub
Autoregressive policies for continuous control reinforcement learning
☆33May 15, 2019Updated 7 years ago
erwincoumans / ARS
View on GitHub
An implementation of the Augmented Random Search algorithm
☆14Jan 29, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ac-93 / soft-actor-critic
View on GitHub
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆99Jun 22, 2020Updated 6 years ago
nuria95 / O-RAAC
View on GitHub
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
☆35Feb 9, 2021Updated 5 years ago
mansimov / acktr
View on GitHub
☆17Sep 15, 2017Updated 8 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
AnujMahajanOxf / VIREL
View on GitHub
Code for VIREL: A Variational Inference Framework for Reinforcement Learning
☆14Dec 1, 2019Updated 6 years ago
resibots / kaushik_2018_multi-dex
View on GitHub
Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)
☆13Oct 8, 2018Updated 7 years ago
ravi-lanka-4 / CoPiEr
View on GitHub
Co-training for Policy Learning
☆13Aug 8, 2019Updated 6 years ago