Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)
☆29Nov 23, 2024Updated last year
Alternatives and similar repositories for SD-SAC
Users that are interested in SD-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- ☆12Jan 4, 2023Updated 3 years ago
- ☆40Nov 17, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- Seq-HGNN: Learning Sequential Node Representation on Heterogeneous Graph☆12Aug 2, 2023Updated 2 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- 数据科学与人工智能中文讲义☆14May 13, 2026Updated last week
- Collection of open source hypervolume codes that have been standardized to work with the MOEA Framework.☆13Apr 6, 2024Updated 2 years ago
- ☆15Apr 21, 2024Updated 2 years ago
- Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction☆12Aug 19, 2024Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆35Aug 17, 2022Updated 3 years ago
- Mobility-aware Dynamic Joint Power Control and Resource Allocation for D2D underlaying cellular networks☆14Sep 6, 2020Updated 5 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- ☆14Nov 17, 2023Updated 2 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- ☆34Mar 24, 2023Updated 3 years ago
- ☆15Jun 2, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Nov 5, 2024Updated last year
- A collection of matrix games in JAX☆13Apr 13, 2026Updated last month
- ☆29Jun 6, 2024Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- ☆12Mar 14, 2024Updated 2 years ago
- cd with an alias name☆12Jan 24, 2019Updated 7 years ago
- Transactive Energy Service System☆14Mar 4, 2023Updated 3 years ago
- Environment to train and compare irrigation scheduling strategies with AquaCrop-OSPy☆16Jun 6, 2022Updated 3 years ago
- ☆12Mar 12, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jun 17, 2022Updated 3 years ago
- ☆11Sep 10, 2022Updated 3 years ago
- Java Code for Paper: Variable-Length Particle Swarm Optimisation for Feature Selection on High-Dimensional Classification☆10Jul 2, 2020Updated 5 years ago
- RL Dresden Algorithm Suite☆36Jul 22, 2024Updated last year
- code☆14Feb 22, 2023Updated 3 years ago
- A LLM-powered agent for NetHack☆23Nov 4, 2024Updated last year
- docker image for reinforcement learning including Open AI roboschool☆13Jun 16, 2019Updated 6 years ago