Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
☆75May 7, 2019Updated 6 years ago
Alternatives and similar repositories for Actor-Critic-PyTorch
Users that are interested in Actor-Critic-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Actor Critic model to play Cartpole game☆53Aug 4, 2018Updated 7 years ago
- Deep Q-learning approach to OpenAI Gym's Lunar Lander☆15Jul 27, 2017Updated 8 years ago
- PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.☆422Mar 17, 2021Updated 5 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆107Jun 7, 2019Updated 6 years ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆43Nov 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Aug 8, 2021Updated 4 years ago
- Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' env…☆13Nov 14, 2021Updated 4 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Mar 24, 2023Updated 3 years ago
- Accepted by AROB 2021. A car-agent navigates in complex traffic conditions by Mixed_Input_PPO_CNN_LSTM model.☆14May 22, 2021Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆2,336Jul 9, 2024Updated last year
- ☆13Jan 14, 2020Updated 6 years ago
- A simple example of how to implement vector based DDPG using PyTorch and a ML-Agents environment.☆18Dec 23, 2018Updated 7 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Feb 13, 2024Updated 2 years ago
- Official implementation of paper "Neural Combinatorial Optimization for Multiobjective Task Offloading in Mobile Edge Computing"☆18Aug 26, 2025Updated 7 months ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- ☆22Dec 3, 2025Updated 4 months ago
- This Python code implements an atmospheric dispersion model for estimating contaminant concentration using the Gaussian plume solution, s…☆14Aug 19, 2025Updated 8 months ago
- Quantum Principal Component Analysis (QPCA) as a generative model☆13Apr 5, 2022Updated 4 years ago
- Optimization models and their applications in power systems☆14Jan 11, 2018Updated 8 years ago
- This is our attempt at replicating the results of the famous ICRA 2015 paper on Intention aware Online POMDP planning for autonomous syst…☆16May 21, 2020Updated 5 years ago
- Bidirectionally-Coordinated Net Implements with PyTorch 1.0☆15Apr 10, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21Jul 4, 2019Updated 6 years ago
- A toy example of Policy Gradient implemented in Pytorch☆95Jan 24, 2018Updated 8 years ago
- Self-Labeling the Job Shop Scheduling Problem☆21Jun 26, 2024Updated last year
- Sumo OSM short usage tutorial☆15Feb 7, 2018Updated 8 years ago
- Reinforcement Learning Benchmark☆13Sep 9, 2020Updated 5 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Qiskit camp 2019 hackathon: Using QAOA for solving the graph coloring problem☆11May 21, 2019Updated 6 years ago
- ☆24Feb 22, 2023Updated 3 years ago
- PyTorch implementation for "Temperature as Uncertainty in Contrastive Learning" (https://arxiv.org/abs/2110.04403).☆16Oct 19, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Solutions for different Reinforcement Learning environments☆26Aug 2, 2024Updated last year
- Reinforcement Learning Environments for Omniverse Isaac Gym☆10May 9, 2023Updated 2 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆66Jul 13, 2017Updated 8 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- Suggestions for those interested in developing audio applications of machine learning☆14Jan 10, 2020Updated 6 years ago
- Codes used to perform the experiments described in this work: https://arxiv.org/abs/1904.05803☆12Aug 29, 2019Updated 6 years ago