Simple, readable, yet full-featured implementation of PPO in Pytorch
☆51Apr 25, 2025Updated 11 months ago
Alternatives and similar repositories for pytorch-ppo
Users that are interested in pytorch-ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository containing Robot Servers ROS packages☆30Jul 22, 2025Updated 8 months ago
- This project was created for Unity ML-Agents Challenge - https://connect.unity.com/challenges/ml-agents-1☆12Aug 15, 2020Updated 5 years ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 7 years ago
- Implementation of HER algorithm in the bit-flipping environment.☆17Feb 20, 2018Updated 8 years ago
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A MATLAB simple interactive Reinforcement Learning environment for Evolutionary Neural Network-based car with a proximity sensor☆14Apr 11, 2019Updated 6 years ago
- D3QN framework for distributed resource allocation☆18Jul 17, 2024Updated last year
- A series of improved methods are used for visual tracking☆10Nov 29, 2025Updated 3 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Example using package.xml to set gazebo model paths☆12Sep 30, 2018Updated 7 years ago
- NeurIPS Reproducibility Challenge 2019☆21Feb 25, 2020Updated 6 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆2,324Jul 9, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆33Jun 14, 2018Updated 7 years ago
- ☆14Oct 11, 2022Updated 3 years ago
- The website of Matterport3D-Layout.☆17Sep 9, 2020Updated 5 years ago
- Gymnasium environment for reinforcement learning with multicopters☆32Jun 4, 2024Updated last year
- Graph convolutional memory☆16May 26, 2022Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- [TCSVT2025] AVLTrack: Dynamic Sparse Learning for Aerial Vision-Language Tracking☆17Mar 10, 2026Updated 2 weeks ago
- Paper list for vision-language tracking☆40Nov 10, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- We optimize SIEP algorithm in multiple intelligent agents scenario and comparatively research A*, DFS, BFS, Dijkstra, PFP and PRM.☆15Jul 31, 2024Updated last year
- <개발자를 위한 필수 수학>(한빛미디어, 2024)의 코드 저장소☆18Jan 9, 2025Updated last year
- ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research☆32Jan 19, 2026Updated 2 months ago
- Artifacts for the PLDI 2023 paper "Search-Based Regular Expression Inference on a GPU"☆17Feb 26, 2025Updated last year
- A URSim (Universal Robots Simulator) Docker Container with a Browser Accessible Interface☆16Nov 11, 2019Updated 6 years ago
- ☆25May 10, 2025Updated 10 months ago
- ☆21Sep 6, 2021Updated 4 years ago
- ☆18Apr 15, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Academic Study of A Multi-Agent Quadrotors (Drones) Simulator with Obstacles and Goals Using the Artificial Potential Field Approach(APF)…☆18Feb 13, 2022Updated 4 years ago
- Use Gaussian processes to estimate CNN classification uncertainty☆12Mar 3, 2018Updated 8 years ago
- A python3 RC4 implementation that doesn't suck. (i.e. it's actually binary-safe...)☆19Sep 3, 2024Updated last year
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- Class project for COMP-781, Robotics. This is a CUDA-based collision detector for motion planning.☆13Apr 29, 2019Updated 6 years ago
- Broodwar replays dumper (using BWAPI)☆13Apr 19, 2012Updated 13 years ago
- Replicating Imagination-Augmented Agents for Deep Reinforcement Learning☆20Dec 17, 2017Updated 8 years ago