Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)
☆28Nov 23, 2024Updated last year
Alternatives and similar repositories for SD-SAC
Users that are interested in SD-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- Bayesian Soft Actor Critic☆16Jan 6, 2023Updated 3 years ago
- ☆12Jan 4, 2023Updated 3 years ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Nov 21, 2023Updated 2 years ago
- Seq-HGNN: Learning Sequential Node Representation on Heterogeneous Graph☆12Aug 2, 2023Updated 2 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- 数据科学与人工智能中文讲义☆14Apr 6, 2026Updated last month
- ☆11May 13, 2019Updated 6 years ago
- Reinforcement learning for batch bioprocess optimization (Computers & Chemical Engineering, 2020)☆16Jun 14, 2022Updated 3 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction☆12Aug 19, 2024Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Mobility-aware Dynamic Joint Power Control and Resource Allocation for D2D underlaying cellular networks☆14Sep 6, 2020Updated 5 years ago
- ☆11Oct 9, 2019Updated 6 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- ☆14Nov 17, 2023Updated 2 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆34Mar 24, 2023Updated 3 years ago
- ☆15Jun 2, 2024Updated last year
- ☆10Nov 5, 2024Updated last year
- A collection of matrix games in JAX☆13Apr 13, 2026Updated 3 weeks ago
- ☆29Jun 6, 2024Updated last year
- the source code of UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios☆42Mar 26, 2022Updated 4 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- ☆12Mar 14, 2024Updated 2 years ago
- cd with an alias name☆12Jan 24, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Environment to train and compare irrigation scheduling strategies with AquaCrop-OSPy☆16Jun 6, 2022Updated 3 years ago
- ☆14Jun 20, 2019Updated 6 years ago
- ☆12Mar 12, 2018Updated 8 years ago
- ☆13Jun 17, 2022Updated 3 years ago
- ☆11Sep 10, 2022Updated 3 years ago
- RL Dresden Algorithm Suite☆36Jul 22, 2024Updated last year
- A LLM-powered agent for NetHack☆22Nov 4, 2024Updated last year