xtma/dsac

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xtma/dsac)

xtma / dsac

Distributional Soft Actor Critic

☆62

Alternatives and similar repositories for dsac

Users that are interested in dsac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BY571 / D4PG
View on GitHub
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆24Apr 7, 2021Updated 5 years ago
BY571 / IQN-and-Extensions
View on GitHub
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆94Mar 4, 2023Updated 3 years ago
BY571 / FQF-and-Extensions
View on GitHub
PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…
☆34Oct 10, 2020Updated 5 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
View on GitHub
PyTorch implementation of FQF, IQN and QR-DQN.
☆191Jul 25, 2024Updated last year
nuria95 / O-RAAC
View on GitHub
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
☆35Feb 9, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
joeybose / FloRL
View on GitHub
Implicit Normalizing Flows + Reinforcement Learning
☆62May 31, 2019Updated 6 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
deligentfool / dqn_zoo
View on GitHub
The implement of all kinds of dqn reinforcement learning with Pytorch
☆97Mar 25, 2021Updated 5 years ago
xtma / apo
View on GitHub
Average-Reward Reinforcement Learning with Trust Region Methods
☆11Oct 17, 2022Updated 3 years ago
deligentfool / SIDE
View on GitHub
Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"
☆11Jun 24, 2022Updated 3 years ago
tpet / rpz_planning
View on GitHub
Gradient-based planning on RPZ subspace.
☆10Jun 15, 2023Updated 2 years ago
ThibautTheate / Unconstrained-Monotonic-Deep-Q-Network-algorithm
View on GitHub
Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…
☆11Jun 3, 2022Updated 3 years ago
deligentfool / HAVEN
View on GitHub
Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"
☆27Oct 22, 2022Updated 3 years ago
mossr / CrossEntropyVariants.jl
View on GitHub
Cross-entropy method variants for optimization in Julia
☆12Apr 29, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Jingliang-Duan / DSAC-v2
View on GitHub
DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic
☆444Dec 1, 2025Updated 5 months ago
huanzhang12 / SA_PPO
View on GitHub
[NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning
☆31Nov 18, 2021Updated 4 years ago
vluzko / dac-iclr-reproducibility
View on GitHub
ICLR Reproducibility Challenge for Discriminator-Actor-Critic
☆20Jan 7, 2019Updated 7 years ago
carljohanhoel / EnsembleQuantileNetworks
View on GitHub
☆25Jan 13, 2022Updated 4 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 6 years ago
simsimiSION / pymarl-algorithm-extension-via-starcraft
View on GitHub
☆13Aug 15, 2020Updated 5 years ago
jesbu1 / carl
View on GitHub
Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings
☆14Nov 22, 2022Updated 3 years ago
tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ido90 / CeSoR
View on GitHub
☆19Nov 22, 2023Updated 2 years ago
dyne-submission / dynamics-aware-embeddings
View on GitHub
☆16Sep 25, 2019Updated 6 years ago
polixir / NeoRL2
View on GitHub
☆19Oct 27, 2025Updated 6 months ago
eugenevinitsky / robust_RL_multi_adversary
View on GitHub
We investigate the effect of populations on finding good solutions to the robust MDP
☆29Mar 27, 2021Updated 5 years ago
BorealisAI / mtmfrl
View on GitHub
Multi Type Mean Field Reinforcement Learning
☆31Jun 13, 2022Updated 3 years ago
TTomilin / COOM
View on GitHub
COOM: Benchmarking Continual Reinforcement Learning on Doom
☆25Mar 5, 2026Updated 2 months ago
laonahongchen / Bilevel-Optimization-in-Coordination-Game
View on GitHub
code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)
☆61Jun 17, 2020Updated 5 years ago
Cranial-XIX / marl-copa
View on GitHub
PyTorch Implementation of COPA for coordinating teams that can dynamically change.
☆23Apr 16, 2022Updated 4 years ago
nnaisense / MAX
View on GitHub
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆81Jul 23, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
instance01 / fish-rl-alife
View on GitHub
Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.
☆10Nov 30, 2021Updated 4 years ago
StanfordASL / RSIRL
View on GitHub
Risk-sensitive Inverse Reinforcement Learning
☆11Sep 11, 2019Updated 6 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
urosolia / MOMDP
View on GitHub
solver for discrete Mixed Observable Markov Decision Processes
☆11Oct 30, 2020Updated 5 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
mxu34 / mbrl-gpmm
View on GitHub
☆28Jun 23, 2020Updated 5 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago