Distributional Soft Actor Critic
☆62Jun 6, 2020Updated 5 years ago
Alternatives and similar repositories for dsac
Users that are interested in dsac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆94Mar 4, 2023Updated 3 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆34Oct 10, 2020Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆191Jul 25, 2024Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆97Mar 25, 2021Updated 5 years ago
- Average-Reward Reinforcement Learning with Trust Region Methods☆11Oct 17, 2022Updated 3 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 3 years ago
- Gradient-based planning on RPZ subspace.☆10Jun 15, 2023Updated 2 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"☆27Oct 22, 2022Updated 3 years ago
- Cross-entropy method variants for optimization in Julia☆12Apr 29, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆444Dec 1, 2025Updated 5 months ago
- [NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning☆31Nov 18, 2021Updated 4 years ago
- ICLR Reproducibility Challenge for Discriminator-Actor-Critic☆20Jan 7, 2019Updated 7 years ago
- ☆25Jan 13, 2022Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆13Aug 15, 2020Updated 5 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Aug 11, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Nov 22, 2023Updated 2 years ago
- ☆16Sep 25, 2019Updated 6 years ago
- ☆19Oct 27, 2025Updated 6 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆29Mar 27, 2021Updated 5 years ago
- Multi Type Mean Field Reinforcement Learning☆31Jun 13, 2022Updated 3 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆25Mar 5, 2026Updated 2 months ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆61Jun 17, 2020Updated 5 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- ☆28Jun 23, 2020Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago