Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆100Jul 23, 2019Updated 6 years ago
Alternatives and similar repositories for Policy-Gradient-Methods
Users that are interested in Policy-Gradient-Methods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Feb 13, 2024Updated 2 years ago
- Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51☆322Mar 9, 2020Updated 6 years ago
- Implementation of the TD3 algorithm written in Pytorch☆12Dec 8, 2022Updated 3 years ago
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 5 years ago
- Modular PyTorch implementation of policy gradient methods☆24Nov 15, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Simple A3C implementation with pytorch + multiprocessing☆657Mar 10, 2023Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆67Jul 9, 2019Updated 6 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- ROS2 template for inverse kinematics for leader-follower teleoperation using Pinocchio☆18Jun 13, 2024Updated last year
- ☆18Aug 14, 2023Updated 2 years ago
- [Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environment☆10Dec 22, 2018Updated 7 years ago
- Implementation of robust adaptive control methods for the linear quadratic regulator☆36Dec 13, 2021Updated 4 years ago
- Code to reproduce results in peer-reviewed publications☆23Dec 19, 2025Updated 5 months ago
- Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment☆89Jul 25, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆256May 3, 2020Updated 6 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆4,628Mar 24, 2023Updated 3 years ago
- planning trajectories for UAVs☆12Mar 21, 2021Updated 5 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Apr 27, 2020Updated 6 years ago
- Host directory for all GradientCrescent and TowardsDataScience Deep Learning projects☆37Sep 12, 2020Updated 5 years ago
- RLCodebase: PyTorch Codebase For Deep Reinforcement Learning Algorithms☆28Jul 23, 2023Updated 2 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆34Oct 16, 2021Updated 4 years ago
- Soft Actor-Critic☆1,260Nov 29, 2023Updated 2 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,075Jul 14, 2023Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆197Dec 8, 2022Updated 3 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 6 years ago
- ☆24Mar 1, 2024Updated 2 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆22Apr 3, 2024Updated 2 years ago
- UAV PATH TRACKING AND DYNAMIC AVOIDANCE BASED ON ADS-B AND DEEP REINFORCEMENT LEARNING for Univerisity of Bristol RP3 final☆12Apr 18, 2023Updated 3 years ago
- ☆16Oct 9, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Sep 1, 2021Updated 4 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆408Dec 18, 2021Updated 4 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- This repository contains PyTorch implementations of deep reinforcement learning algorithms and environments for Robotics and Controls. T…☆19Mar 20, 2022Updated 4 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- A toolbox for trajectory optimization of dynamical systems☆56Jun 16, 2022Updated 3 years ago