Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)
☆27Nov 23, 2024Updated last year
Alternatives and similar repositories for SD-SAC
Users that are interested in SD-SAC are comparing it to the libraries listed below
Sorting:
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- ☆40Nov 17, 2021Updated 4 years ago
- ☆27Jun 6, 2024Updated last year
- ☆35Aug 17, 2022Updated 3 years ago
- ☆34Mar 24, 2023Updated 2 years ago
- RL Dresden Algorithm Suite☆33Jul 22, 2024Updated last year
- ☆17Feb 1, 2026Updated last month
- Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"☆45Apr 1, 2024Updated last year
- Transactive Energy Service System☆14Mar 4, 2023Updated 3 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Alpha mining with DEAP-based genetic programming.☆11Jul 7, 2023Updated 2 years ago
- factory.ai FACTORY_API_KEY switch and query☆27Dec 6, 2025Updated 3 months ago
- ☆11Nov 13, 2025Updated 3 months ago
- Accepted at WWW 25 Industrial Track (oral)☆18Jun 6, 2025Updated 9 months ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- ☆12Mar 14, 2024Updated last year
- State of the art time series forecasting method that has the FFORMA ensemble learn from the ESRNN hybrid model and others.☆13Sep 7, 2022Updated 3 years ago
- ☆10May 5, 2021Updated 4 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- ☆11Sep 10, 2022Updated 3 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- ☆11Sep 5, 2024Updated last year
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆16Aug 26, 2023Updated 2 years ago
- Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction☆12Aug 19, 2024Updated last year
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 3 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- ☆12Jun 2, 2024Updated last year
- ☆11Jul 10, 2025Updated 7 months ago
- Testing and implementation of ML algorithms for the analysis of cryptocurrency trends.☆11Feb 20, 2024Updated 2 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆11Aug 20, 2024Updated last year
- ITU-T Rec. P.1203 Codec Extension to VP9 and HEVC☆14Mar 16, 2020Updated 5 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆11May 13, 2019Updated 6 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago