The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
Alternatives and similar repositories for DSAC
Users that are interested in DSAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 7 years ago
- A gym game for Contra that for reinforcement learning☆10Oct 18, 2021Updated 4 years ago
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- ICLR Reproducibility Challenge for Discriminator-Actor-Critic☆20Jan 7, 2019Updated 7 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 7 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 5 years ago
- NeurIPS 2019 Paper Implementation☆12Nov 22, 2022Updated 3 years ago
- ☆16Jan 21, 2022Updated 4 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)☆12Feb 16, 2023Updated 3 years ago
- ☆26Jun 14, 2022Updated 4 years ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated last year
- Simulation of car parking in different parking lots using Unity ML-Agents☆13Dec 16, 2023Updated 2 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆132Jun 11, 2019Updated 7 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 7 years ago
- ☆26Mar 16, 2023Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆39May 6, 2026Updated last month
- Gamepad API Content Kit☆14Jun 1, 2016Updated 10 years ago
- Pytorch implementation of Planar Flow☆18Dec 2, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Learning Inverse Kinematics of a Barret WAM Robotic arm in Gazebo simulation☆11Jun 7, 2018Updated 8 years ago
- Collection of reinforcement learning algorithms☆16Sep 29, 2025Updated 9 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for Sibling Rivalry and experiments presented in associated paper☆18May 1, 2025Updated last year
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Apr 1, 2022Updated 4 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Imitation Learning with the INTERACTION Dataset☆37Apr 1, 2024Updated 2 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆54Oct 18, 2021Updated 4 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago