Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
☆76May 7, 2019Updated 7 years ago
Alternatives and similar repositories for Actor-Critic-PyTorch
Users that are interested in Actor-Critic-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenAI Gym's LunarLander-v2 Implementation☆41Apr 27, 2024Updated 2 years ago
- Actor Critic model to play Cartpole game☆53Aug 4, 2018Updated 7 years ago
- Deep Q-learning approach to OpenAI Gym's Lunar Lander☆15Jul 27, 2017Updated 8 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆107Jun 7, 2019Updated 6 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' env…☆13Nov 14, 2021Updated 4 years ago
- PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"☆11Jan 5, 2014Updated 12 years ago
- Minimal Implementation of Deep RL Algorithms in PyTorch☆25May 10, 2020Updated 6 years ago
- Translation and understanding of the Pop-art paper.☆18Oct 21, 2019Updated 6 years ago
- Accepted by AROB 2021. A car-agent navigates in complex traffic conditions by Mixed_Input_PPO_CNN_LSTM model.☆14May 22, 2021Updated 5 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Feb 13, 2024Updated 2 years ago
- Official implementation of paper "Neural Combinatorial Optimization for Multiobjective Task Offloading in Mobile Edge Computing"☆20Aug 26, 2025Updated 9 months ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Quantum Principal Component Analysis (QPCA) as a generative model☆13Apr 5, 2022Updated 4 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- ☆21Jul 4, 2019Updated 6 years ago
- Bidirectionally-Coordinated Net Implements with PyTorch 1.0☆15Apr 10, 2019Updated 7 years ago
- A toy example of Policy Gradient implemented in Pytorch☆95Jan 24, 2018Updated 8 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- Self-Labeling the Job Shop Scheduling Problem☆22Jun 26, 2024Updated last year
- Compact LaTeX Template for the standard institute format. This is a modification of MR Bharath's LaTeX template. I've made it more compac…☆11Dec 28, 2016Updated 9 years ago
- Python-based tool to generate anthropometric human whole-body models in a URDF format☆33Jun 20, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Account name changed!☆21Apr 13, 2012Updated 14 years ago
- Qiskit camp 2019 hackathon: Using QAOA for solving the graph coloring problem☆11May 21, 2019Updated 7 years ago
- ☆24Feb 22, 2023Updated 3 years ago
- Solutions for different Reinforcement Learning environments☆26Aug 2, 2024Updated last year
- Reinforcement Learning Environments for Omniverse Isaac Gym☆10May 9, 2023Updated 3 years ago
- Semantic-Aware Fine-Grained Correspondence, at ECCV 2022 (Oral)☆14Oct 29, 2022Updated 3 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆66Jul 13, 2017Updated 8 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Suggestions for those interested in developing audio applications of machine learning☆14Jan 10, 2020Updated 6 years ago
- ☆15Mar 10, 2021Updated 5 years ago
- Official Implementation of Memento☆21Nov 19, 2024Updated last year
- code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"☆16Aug 15, 2022Updated 3 years ago
- ☆12Aug 24, 2021Updated 4 years ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- 在PyTorch上重构multi-agent deep deterministic policy gradient(MADDPG),将https://github.com/xuemei-ye/maddpg-mpe 修改到自己电脑上可运行。因为本人笔记本没有CUDA,实验速度…☆14May 10, 2019Updated 7 years ago