Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)
☆29Nov 23, 2024Updated last year
Alternatives and similar repositories for SD-SAC
Users that are interested in SD-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- ☆12Jan 4, 2023Updated 3 years ago
- ☆40Nov 17, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- Seq-HGNN: Learning Sequential Node Representation on Heterogeneous Graph☆12Aug 2, 2023Updated 2 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- 数据科学与人工智能中文讲义☆14May 13, 2026Updated last month
- ☆11May 13, 2019Updated 7 years ago
- ☆15Apr 21, 2024Updated 2 years ago
- Reinforcement learning for batch bioprocess optimization (Computers & Chemical Engineering, 2020)☆16Jun 14, 2022Updated 4 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆12Jul 15, 2022Updated 3 years ago
- Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction☆12Aug 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆45Oct 31, 2024Updated last year
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- ☆35Aug 17, 2022Updated 3 years ago
- Mobility-aware Dynamic Joint Power Control and Resource Allocation for D2D underlaying cellular networks☆14Sep 6, 2020Updated 5 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- ☆14Nov 17, 2023Updated 2 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- ☆34Mar 24, 2023Updated 3 years ago
- ☆15Jun 2, 2024Updated 2 years ago
- ☆10Nov 5, 2024Updated last year
- A collection of matrix games in JAX☆14Apr 13, 2026Updated 2 months ago
- ☆29Jun 6, 2024Updated 2 years ago
- the source code of UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios☆42Mar 26, 2022Updated 4 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- cd with an alias name☆12Jan 24, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Transactive Energy Service System☆14Mar 4, 2023Updated 3 years ago
- A drop-in replacement for Rosetta Relax☆29Jan 30, 2026Updated 4 months ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Environment to train and compare irrigation scheduling strategies with AquaCrop-OSPy☆16Jun 6, 2022Updated 4 years ago
- ☆14Jun 20, 2019Updated 6 years ago
- ☆13Jun 17, 2022Updated 3 years ago
- ☆11Sep 10, 2022Updated 3 years ago