Soft Actor-Critic with advanced features
☆51Jan 4, 2026Updated 2 months ago
Alternatives and similar repositories for Advanced-Soft-Actor-Critic
Users that are interested in Advanced-Soft-Actor-Critic are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- This is code of paper entitled "AI-based Radio Resource and Transmission Opportunity Allocation for 5G-V2X HetNets: NR and NR-U networks…☆15Sep 8, 2023Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Nov 21, 2023Updated 2 years ago
- Implementations of Stable Contrastive RL☆23Apr 13, 2025Updated 10 months ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- Master Thesis☆10Jan 28, 2023Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- OpenRAN Gym website☆12Dec 11, 2025Updated 2 months ago
- ☆10Jun 27, 2025Updated 8 months ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Apr 16, 2020Updated 5 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- End-2-end V2X latency models in 5G networks☆11Jan 17, 2023Updated 3 years ago
- Repository for 5G-Monarch paper☆13Jan 16, 2026Updated last month
- ☆11Oct 19, 2020Updated 5 years ago
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Code implementing the algorithm and the benchmark of the paper "Power Minimization of Downlink Spectrum Slicing for eMBB and URLLC Users"☆13Dec 1, 2022Updated 3 years ago
- ☆16Jan 1, 2023Updated 3 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- ☆15Jan 7, 2022Updated 4 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆232Sep 13, 2024Updated last year
- myGym is the robotic simulator. It allows to create novel multi step long horizon tasks without coding. There are automatic task building…☆53Updated this week
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Jan 15, 2022Updated 4 years ago
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 2 years ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆30Jan 26, 2023Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- A Python implementation of the non-dominated sorting.☆13Jul 17, 2023Updated 2 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Jan 31, 2020Updated 6 years ago
- This project was moved to: https://github.com/coax-dev/coax☆161Nov 28, 2022Updated 3 years ago