soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
Alternatives and similar repositories for Entropy-Regularized-RL
Users that are interested in Entropy-Regularized-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- A small library intended for controlling KUKA robots using KRC4 over KUKA RSI (Robot Sensor Interface) from Simulink.☆11Mar 30, 2017Updated 9 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47May 28, 2019Updated 6 years ago
- Javakuka is an open-source project for creating Kuka Robot Language (KRL) codes in Java.☆10May 15, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TobotAI Slam Android手机端建图应用软件☆17Nov 29, 2023Updated 2 years ago
- A converter for Euler Angle,Axis Angle,Quaternion,Rotation Matrix.☆16Jun 9, 2021Updated 4 years ago
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- practice☆10Jun 30, 2020Updated 5 years ago
- Python library used to communicate and control Kuka manipulator (tested on KR16 model). Under continuous development.☆12Aug 9, 2018Updated 7 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆11Apr 8, 2025Updated last year
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Dec 8, 2022Updated 3 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Quadratic Programming for Continuous Control of Safety-Critical Multi-Agent Systems Under Uncertainty☆14Sep 7, 2024Updated last year
- Waste Sorting with Robot Arm Tossing☆10Sep 19, 2023Updated 2 years ago
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆438Nov 28, 2023Updated 2 years ago
- The continuous mountain car problem solved with DDPG☆13Apr 19, 2020Updated 6 years ago
- Designing an optimized path for multiple robots in a warehouse for picking and delivery operations using A* algorithm (shortest path) and…☆11Jul 28, 2023Updated 2 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆58Oct 18, 2022Updated 3 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆51Jun 7, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆19Feb 18, 2022Updated 4 years ago
- behavior cloning from observation☆38Dec 14, 2020Updated 5 years ago
- Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"☆11May 26, 2019Updated 6 years ago
- [Humanoids 2022] Learning Collision-free and Torque-limited Robot Trajectories based on Alternative Safe Behaviors☆23Dec 18, 2025Updated 5 months ago
- Pytorch implementation of large network design in continous control RL.☆19Jan 5, 2022Updated 4 years ago
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆17Mar 11, 2020Updated 6 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆11Jun 24, 2022Updated 3 years ago
- Uncertainty-Aware DRL for Autonomous Vehicle Crowd Navigation in Shared Space (IEEE-IV-2024)☆30Jun 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- reinforcement learning, navigation, unitree, velodyne, slam, collision avoidance☆29Jul 8, 2025Updated 10 months ago
- This is the official implementation of the voxel-based humanoid locomotion in "Gallant: Voxel Grid-based Humanoid Locomotion and Local-na…☆66Apr 24, 2026Updated 3 weeks ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Multi-Agent Determinantal Q-Learning☆43Nov 22, 2022Updated 3 years ago