Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆100Jul 23, 2019Updated 6 years ago
Alternatives and similar repositories for Policy-Gradient-Methods
Users that are interested in Policy-Gradient-Methods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Feb 13, 2024Updated 2 years ago
- Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51☆319Mar 9, 2020Updated 6 years ago
- Implementation of the TD3 algorithm written in Pytorch☆12Dec 8, 2022Updated 3 years ago
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 4 years ago
- Modular PyTorch implementation of policy gradient methods☆25Nov 15, 2018Updated 7 years ago
- Simple A3C implementation with pytorch + multiprocessing☆656Mar 10, 2023Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆67Jul 9, 2019Updated 6 years ago
- ☆17Jul 18, 2022Updated 3 years ago
- ☆18Aug 14, 2023Updated 2 years ago
- Implementation of robust adaptive control methods for the linear quadratic regulator☆36Dec 13, 2021Updated 4 years ago
- Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment☆90Jul 25, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆254May 3, 2020Updated 5 years ago
- Code for ICML 2019 paper on "Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations"☆19Jan 2, 2021Updated 5 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆4,593Mar 24, 2023Updated 3 years ago
- planning trajectories for UAVs☆12Mar 21, 2021Updated 5 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Apr 27, 2020Updated 5 years ago
- ☆32Mar 12, 2024Updated 2 years ago
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- RL algorithm implementations from scratch.☆17Nov 22, 2020Updated 5 years ago
- ☆31Oct 16, 2021Updated 4 years ago
- Soft Actor-Critic☆1,230Nov 29, 2023Updated 2 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,044Jul 14, 2023Updated 2 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆198Dec 8, 2022Updated 3 years ago
- ☆23Mar 1, 2024Updated 2 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆23Apr 3, 2024Updated last year
- Some AI planners that work with PDDL, file examples and integration in ROSPlan.☆25Apr 27, 2018Updated 7 years ago
- UAV PATH TRACKING AND DYNAMIC AVOIDANCE BASED ON ADS-B AND DEEP REINFORCEMENT LEARNING for Univerisity of Bristol RP3 final☆12Apr 18, 2023Updated 2 years ago
- ☆14Sep 1, 2021Updated 4 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆400Dec 18, 2021Updated 4 years ago
- Hello😜☆29Nov 8, 2020Updated 5 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- This repository contains PyTorch implementations of deep reinforcement learning algorithms and environments for Robotics and Controls. T…☆19Mar 20, 2022Updated 4 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- A toolbox for trajectory optimization of dynamical systems☆56Jun 16, 2022Updated 3 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- a simple demo of pendulem control using DDPG algorithm☆13Jan 27, 2018Updated 8 years ago
- Online informative path planning for active information gathering of a 3d surface☆17Dec 20, 2021Updated 4 years ago