☆40Nov 17, 2021Updated 4 years ago
Alternatives and similar repositories for DiscreteSAC
Users that are interested in DiscreteSAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆58Oct 1, 2021Updated 4 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆43Oct 23, 2024Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆28Nov 23, 2024Updated last year
- PyTorch implementation of discrete version of Soft Actor-Critic.☆37Sep 19, 2021Updated 4 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆21Jul 27, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Reference code for the paper ""Centroid-Guided Target-Driven Topology Control Method for UAV Ad-Hoc Networks Based on Tiny Deep Reinforce…☆11Oct 21, 2024Updated last year
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.☆27Jun 6, 2025Updated 11 months ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game☆23Jan 29, 2023Updated 3 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆39Feb 19, 2022Updated 4 years ago
- solve LASSO formulation with Proximal Gradient Descent, Accelerated Gradient Descent, and Coordinate Gradient Descent☆21Dec 31, 2014Updated 11 years ago
- ☆33Jun 16, 2023Updated 2 years ago
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.☆13Dec 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open Source Reinforcement Learning Framework for Routing and Spectrum Assignment☆10Mar 18, 2021Updated 5 years ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆54Jul 19, 2022Updated 3 years ago
- Multi Agent SAC and DDPG applied to path finding in a 3-dimensional grid☆15Aug 8, 2021Updated 4 years ago
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- ☆10Apr 2, 2023Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆107Jun 9, 2020Updated 5 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Soccer toy example simulator used in Reinforcement Learning☆12Mar 11, 2018Updated 8 years ago
- Trust Management for Vehicular Networks☆11Aug 6, 2025Updated 9 months ago
- Highway-Env Agent using DQN☆19May 29, 2022Updated 3 years ago
- ☆14Sep 25, 2023Updated 2 years ago
- Triangulated irregular network☆11Mar 29, 2015Updated 11 years ago
- multi-workflow scheduling☆15Dec 30, 2021Updated 4 years ago
- ppo+action mask for atari tennis agent☆12Mar 2, 2023Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multi-agent Deep Reinforcement Learning for Efficient Computation Offloading in Mobile Edge Computing☆14Jun 7, 2023Updated 2 years ago
- Python API for the SUMO environment of Plymouth Rd.☆14Feb 1, 2021Updated 5 years ago
- Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)☆10Oct 11, 2019Updated 6 years ago
- ☆15May 4, 2025Updated last year
- ☆14Feb 28, 2021Updated 5 years ago
- Blockchain Based Approach for Trust Management in Intelligent Transportation Systems with Smart Contracts☆13Jul 19, 2022Updated 3 years ago
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago