Soft Actor-Critic with advanced features
☆51Mar 2, 2026Updated last month
Alternatives and similar repositories for Advanced-Soft-Actor-Critic
Users that are interested in Advanced-Soft-Actor-Critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆91Nov 21, 2023Updated 2 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- Implementations of Stable Contrastive RL☆22Apr 13, 2025Updated last year
- OpenRAN Gym website☆13Mar 30, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆121Jul 31, 2024Updated last year
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 7 years ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆15Sep 8, 2023Updated 2 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- A Python implementation of the non-dominated sorting.☆13Jul 17, 2023Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆53Aug 27, 2022Updated 3 years ago
- ☆10Jun 27, 2025Updated 9 months ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆237Sep 13, 2024Updated last year
- Repository for 5G-Monarch paper☆12Jan 16, 2026Updated 2 months ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 3 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Jan 15, 2022Updated 4 years ago
- ☆28Dec 16, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Adaptive traffic signal control using deep reinforcement learning built using SUMO☆13Mar 5, 2022Updated 4 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆662Apr 6, 2021Updated 5 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆452Oct 21, 2021Updated 4 years ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆31Jan 26, 2023Updated 3 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- ☆55Feb 28, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- yolov3-tensorflow-c++☆10Mar 2, 2019Updated 7 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- ☆32Mar 19, 2024Updated 2 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Crazyflie UAV simulation based on the PyFlyt library☆22Aug 29, 2023Updated 2 years ago