Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
☆21May 26, 2021Updated 5 years ago
Alternatives and similar repositories for Parallel-PPO-PyTorch
Users that are interested in Parallel-PPO-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆18Jun 25, 2019Updated 7 years ago
- The demo and SDK of SLAMTEC Aurora, a cutting-edge, all-in-one localization and mapping sensor designed by SLAMTEC☆11Apr 25, 2026Updated 2 months ago
- pytorch implementation for "Mutual Information Neural Estimation"☆11Dec 13, 2019Updated 6 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- A well-documented A2C written in PyTorch☆53Jun 3, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆16Jul 17, 2021Updated 4 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- ☆19Updated this week
- <Do it 강화학습 입문(Getting Started with Deep Reinforcement Learning)> 소스코드 저장소☆34Jul 5, 2021Updated 4 years ago
- This repository provides a summarization of recent empirical studies/human studies that measure human understanding with machine explanat…☆14Jul 24, 2024Updated last year
- Federated Reinforcement Learning☆12Jun 20, 2019Updated 7 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- ☆79Jun 12, 2026Updated 3 weeks ago
- Re-implementation of Neural Architecture Search using Reinforcement Learning☆12May 21, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Jun 26, 2020Updated 6 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- This is Pytorch implementation of our paper "LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition".☆10Sep 23, 2024Updated last year
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- My thesis project☆10Jun 7, 2021Updated 5 years ago
- R2Plus1D MXNet Implementation☆11Jul 11, 2018Updated 7 years ago
- Simple verification experiments codes for multi-agent RL using OpenAI MPE environment☆35Jun 22, 2022Updated 4 years ago
- An project based on USB2CAN and RobStride motor.☆23May 17, 2026Updated last month
- A real-time food recognition and nutrition estimating system on Spark Streaming☆10Aug 18, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Supporting material for Princeton ORF522☆14Aug 27, 2025Updated 10 months ago
- ☆15Dec 13, 2022Updated 3 years ago
- Variational Autoencoder (VAE)-like neural network to solve ideal MHD equilibrium in a tokamak☆11May 20, 2022Updated 4 years ago
- Part 1 project for ME5406 in NUS☆10Jun 25, 2021Updated 5 years ago
- A template for research projects in computer science/machine learning using python and julia☆100May 28, 2026Updated last month
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Flask for Computer-Vision Prototype☆13May 22, 2023Updated 3 years ago
- BPU Tools for LeRobot.☆35Jun 23, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)☆12Mar 24, 2018Updated 8 years ago
- Pallet loading problem solver with recursive partitioning approach for the packing of different rectangles in a rectangle.☆13Oct 1, 2012Updated 13 years ago
- A Python multigrid solver implementation for education.☆20May 11, 2015Updated 11 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 5 years ago
- A simulation benchmark in MuJoCo for dexterous grasping☆51Sep 21, 2025Updated 9 months ago
- Overlooked Factors in Concept-based Explanations: Dataset Choice, Concept Learnability, and Human Capability (CVPR 2023)☆10Mar 14, 2023Updated 3 years ago