Soft Actor-Critic with advanced features
☆52May 29, 2026Updated 2 weeks ago
Alternatives and similar repositories for Advanced-Soft-Actor-Critic
Users that are interested in Advanced-Soft-Actor-Critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆296Feb 24, 2021Updated 5 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆92Nov 21, 2023Updated 2 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- Implementations of Stable Contrastive RL☆22Apr 13, 2025Updated last year
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 7 years ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆16Sep 8, 2023Updated 2 years ago
- OpenRAN Gym website☆14Jun 3, 2026Updated last week
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- A Python implementation of the non-dominated sorting.☆13Jul 17, 2023Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆53Aug 27, 2022Updated 3 years ago
- ☆11Jun 27, 2025Updated 11 months ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Apr 16, 2020Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆242Sep 13, 2024Updated last year
- Repository for 5G-Monarch paper☆12Jan 16, 2026Updated 4 months ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 5 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Jan 15, 2022Updated 4 years ago
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆22Jun 24, 2023Updated 2 years ago
- ☆28Dec 16, 2022Updated 3 years ago
- Adaptive traffic signal control using deep reinforcement learning built using SUMO☆13Mar 5, 2022Updated 4 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Reads http://go.drawthe.net YAML files and recreates a GNS3 topology☆16May 21, 2020Updated 6 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆663Apr 6, 2021Updated 5 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆452Oct 21, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆54Mar 3, 2020Updated 6 years ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆32Jan 26, 2023Updated 3 years ago
- ☆55Feb 28, 2024Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- ☆99Mar 24, 2023Updated 3 years ago