Implementation of PPO in Pytorch
☆41Dec 6, 2017Updated 8 years ago
Alternatives and similar repositories for PPO-Pytorch
Users that are interested in PPO-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Proximal Policy Optimization☆53Dec 20, 2017Updated 8 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Sep 1, 2018Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆276Apr 18, 2020Updated 6 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31May 30, 2018Updated 7 years ago
- Experiments with differentiable stacks and queues in PyTorch☆145Oct 7, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 8 years ago
- A collection of pytorch models and calibration types for generating discrete objects: equations, molecules, etc☆11Jul 6, 2023Updated 2 years ago
- Pytorch Implementation of Proximal Policy Optimization Algorithm☆20Mar 7, 2018Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 8 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- OpenAI Gym Environment for ROS.☆13Nov 1, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for Discriminability objective for training descriptive captions(CVPR 2018)☆109Nov 21, 2019Updated 6 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- ImageNet training code that implements academic defaults☆12Jul 15, 2021Updated 4 years ago
- for learning reinforcement learning using PyTorch.☆64Oct 2, 2019Updated 6 years ago
- PyTorch implementation of CVPR'18 - Perturbative Neural Networks http://xujuefei.com/pnn.html☆57Oct 8, 2018Updated 7 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- ☆19Jun 11, 2022Updated 3 years ago
- ☆58Aug 28, 2018Updated 7 years ago
- NYU GSAS PhD thesis template☆11May 14, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Pytorch implementation of HDGan☆148Nov 1, 2018Updated 7 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆217Mar 10, 2018Updated 8 years ago
- Implementation of Poincare Embedding in PyTorch☆13Jul 27, 2017Updated 8 years ago
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- Library for model based RL in robotics☆37Sep 10, 2018Updated 7 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Aug 20, 2018Updated 7 years ago
- Pytorch NN helpers☆20May 3, 2024Updated 2 years ago
- MRI analysis using PyTorch and MedicalTorch☆64Oct 16, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Unsupervised instance segmentation via active robot interaction☆76Jul 1, 2022Updated 3 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 7 years ago
- PyTorch C++ Reinforcement Learning☆534May 3, 2020Updated 6 years ago
- replicate the results of rule extract lstm☆16Jun 9, 2017Updated 8 years ago
- Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.☆10Nov 7, 2017Updated 8 years ago
- ☆13Mar 26, 2019Updated 7 years ago
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 7 years ago