Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
☆21May 26, 2021Updated 4 years ago
Alternatives and similar repositories for Parallel-PPO-PyTorch
Users that are interested in Parallel-PPO-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆17Jun 25, 2019Updated 6 years ago
- A sample Slurm Cluster Installation Guide on Debian☆14Mar 2, 2018Updated 8 years ago
- FOR my learing☆12Feb 19, 2026Updated last month
- The demo and SDK of SLAMTEC Aurora, a cutting-edge, all-in-one localization and mapping sensor designed by SLAMTEC☆12Mar 11, 2026Updated last month
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.☆27Jul 9, 2025Updated 9 months ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆16Jul 17, 2021Updated 4 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 2 years ago
- <Do it 강화학습 입문(Getting Started with Deep Reinforcement Learning)> 소스코드 저장소☆33Jul 5, 2021Updated 4 years ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated 2 years ago
- Re-implementation of Neural Architecture Search using Reinforcement Learning☆12May 21, 2018Updated 7 years ago
- ☆12Jun 26, 2020Updated 5 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is Pytorch implementation of our paper "LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition".☆11Sep 23, 2024Updated last year
- An online federated reinforcement learning algorithm published in INFOCOM2024☆17Dec 1, 2024Updated last year
- A tool for design pattern recognition on blockchain through static code analysis☆10Jun 8, 2024Updated last year
- ☆10Aug 17, 2021Updated 4 years ago
- Learning-based Grasp Synthesis for Dexterous Hand☆34Sep 21, 2025Updated 6 months ago
- R2Plus1D MXNet Implementation☆11Jul 11, 2018Updated 7 years ago
- Simple verification experiments codes for multi-agent RL using OpenAI MPE environment☆34Jun 22, 2022Updated 3 years ago
- KiCAD plugin written in Python for programatically placing clusters of components onto a PCB from a layout file.☆10Jun 30, 2021Updated 4 years ago
- This is the Pytorch implementation of paper--Training deep neural-networks using a noise adaptation layer.☆10Apr 18, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The light codes for the paper published in JMS named 'Solving task scheduling problems in cloud manufacturing via attention mechanism and…☆20May 15, 2023Updated 2 years ago
- A real-time food recognition and nutrition estimating system on Spark Streaming☆10Aug 18, 2019Updated 6 years ago
- ☆15Dec 13, 2022Updated 3 years ago
- Genetic Algorithm for integer constrained optimization and its applications☆11Oct 5, 2023Updated 2 years ago
- VLSI placement and routing tool☆15Dec 20, 2025Updated 3 months ago
- Variational Autoencoder (VAE)-like neural network to solve ideal MHD equilibrium in a tokamak☆11May 20, 2022Updated 3 years ago
- Part 1 project for ME5406 in NUS☆10Jun 25, 2021Updated 4 years ago
- 用Paddle复现Recipes for building an open-domain chatbot论文☆11Nov 1, 2021Updated 4 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The code enables to perform Bayesian inference in an efficient manner through the use of Hamiltonian Neural Networks (HNNs), Deep Neural …☆17Jan 14, 2023Updated 3 years ago
- A template for research projects in computer science/machine learning using python and julia☆101Feb 18, 2026Updated last month
- BPU Tools for LeRobot.☆30Mar 10, 2026Updated last month
- A Python multigrid solver implementation for education.☆20May 11, 2015Updated 10 years ago
- testing MLP, DQN, PPO, SAC, policy-gradient by snakeAI☆11May 6, 2025Updated 11 months ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- A simulation benchmark in MuJoCo for dexterous grasping☆44Sep 21, 2025Updated 6 months ago