soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
Alternatives and similar repositories for Entropy-Regularized-RL
Users that are interested in Entropy-Regularized-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47May 28, 2019Updated 7 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 6 years ago
- [ICRA 2021] Learning Robot Trajectories subject to Kinematic Joint Constraints☆12Jul 29, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Javakuka is an open-source project for creating Kuka Robot Language (KRL) codes in Java.☆10May 15, 2024Updated 2 years ago
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆39Feb 13, 2021Updated 5 years ago
- practice☆11Jun 30, 2020Updated 5 years ago
- Application of the Industrial Robotic Arm KR6 R900 sixx in 3D Milling that includes developing post-processing tools to convert any conve…☆12Oct 5, 2022Updated 3 years ago
- Python library used to communicate and control Kuka manipulator (tested on KR16 model). Under continuous development.☆12Aug 9, 2018Updated 7 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆11Apr 8, 2025Updated last year
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Sep 25, 2020Updated 5 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Quadratic Programming for Continuous Control of Safety-Critical Multi-Agent Systems Under Uncertainty☆14Sep 7, 2024Updated last year
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- Waste Sorting with Robot Arm Tossing☆10Sep 19, 2023Updated 2 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆438Nov 28, 2023Updated 2 years ago
- The continuous mountain car problem solved with DDPG☆13Apr 19, 2020Updated 6 years ago
- Designing an optimized path for multiple robots in a warehouse for picking and delivery operations using A* algorithm (shortest path) and…☆11Jul 28, 2023Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆51Jun 7, 2021Updated 5 years ago
- ☆20Feb 18, 2022Updated 4 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆11Jan 18, 2025Updated last year
- Aerial Combat environment build around PyFlyt☆12Aug 12, 2023Updated 2 years ago
- Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"☆11May 26, 2019Updated 7 years ago
- [Humanoids 2022] Learning Collision-free and Torque-limited Robot Trajectories based on Alternative Safe Behaviors☆23Dec 18, 2025Updated 5 months ago
- Pytorch implementation of large network design in continous control RL.☆19Jan 5, 2022Updated 4 years ago
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆17Mar 11, 2020Updated 6 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Uncertainty-Aware DRL for Autonomous Vehicle Crowd Navigation in Shared Space (IEEE-IV-2024)☆32Jun 22, 2024Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- This is the official implementation of the voxel-based humanoid locomotion in "Gallant: Voxel Grid-based Humanoid Locomotion and Local-na…☆69Apr 24, 2026Updated last month
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Oct 4, 2020Updated 5 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago