Simple, readable, yet full-featured implementation of PPO in Pytorch
☆52Apr 25, 2025Updated 11 months ago
Alternatives and similar repositories for pytorch-ppo
Users that are interested in pytorch-ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository containing Robot Servers ROS packages☆30Jul 22, 2025Updated 8 months ago
- This project was created for Unity ML-Agents Challenge - https://connect.unity.com/challenges/ml-agents-1☆12Aug 15, 2020Updated 5 years ago
- Modular Object-Oriented Games (MOOG): Python-based game engine for reinforcement learning, psychology, and neurophysiology.☆39Sep 4, 2025Updated 7 months ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 7 years ago
- Implementation of HER algorithm in the bit-flipping environment.☆17Feb 20, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A MATLAB simple interactive Reinforcement Learning environment for Evolutionary Neural Network-based car with a proximity sensor☆14Apr 11, 2019Updated 7 years ago
- Example using package.xml to set gazebo model paths☆12Sep 30, 2018Updated 7 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- NeurIPS Reproducibility Challenge 2019☆21Feb 25, 2020Updated 6 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- ☆12Jan 18, 2022Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆2,336Jul 9, 2024Updated last year
- KANs and MLPs☆12Jun 7, 2024Updated last year
- Files from the published Alpha Star paper by DeepMind☆18Nov 14, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Gymnasium environment for reinforcement learning with multicopters☆32Jun 4, 2024Updated last year
- ☆14Oct 11, 2022Updated 3 years ago
- ☆14Jun 22, 2025Updated 9 months ago
- Graph convolutional memory☆16May 26, 2022Updated 3 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Visual Reaction: Learning to Play Catch with Your Drone☆13Jul 23, 2023Updated 2 years ago
- An implementation of DecorrelatedBN by tensorflow☆13Jun 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- pytest support for ROS☆16Mar 7, 2023Updated 3 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- A URSim (Universal Robots Simulator) Docker Container with a Browser Accessible Interface☆16Nov 11, 2019Updated 6 years ago
- ☆21Sep 6, 2021Updated 4 years ago
- An autonomous grasping solution for the Emika Franka Panda robot.☆15May 22, 2023Updated 2 years ago
- ☆21Jan 23, 2024Updated 2 years ago
- Class project for COMP-781, Robotics. This is a CUDA-based collision detector for motion planning.☆13Apr 29, 2019Updated 6 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- Orthogonal Matching Pursuit, parallelized on both CPU and GPU. 100x+ Speedup☆16Mar 30, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Broodwar replays dumper (using BWAPI)☆13Apr 19, 2012Updated 13 years ago
- Replicating Imagination-Augmented Agents for Deep Reinforcement Learning☆20Dec 17, 2017Updated 8 years ago
- [ICCV 2021] GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion☆66Mar 6, 2023Updated 3 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- ☆12Mar 28, 2019Updated 7 years ago
- An implementation of the Augmented Random Search algorithm☆14Jan 29, 2022Updated 4 years ago