vwxyzjn/PPO-Implementation-Deep-Dive

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vwxyzjn/PPO-Implementation-Deep-Dive)

vwxyzjn / PPO-Implementation-Deep-Dive

DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details

☆46

Alternatives and similar repositories for PPO-Implementation-Deep-Dive

Users that are interested in PPO-Implementation-Deep-Dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

anishmadan23 / MAML_Pytorch_RL
View on GitHub
☆10Aug 8, 2021Updated 4 years ago
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
ywtseng / NTUPlacement
View on GitHub
☆14Oct 23, 2018Updated 7 years ago
PKU-IDEA / MacroRank
View on GitHub
Official implementation of MacroRank: Ranking Macro Placement Solutions Leveraging Translation Equivariancy (ASP-DAC 2023)
☆18Jun 3, 2023Updated 3 years ago
gebob19 / rl_with_jax
View on GitHub
clear single-file JAX implementations of common RL algorithms
☆15Sep 5, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
MarcoMeter / neroRL
View on GitHub
Deep Reinforcement Learning Framework done with PyTorch
☆43Mar 12, 2025Updated last year
atlarge-research / opendc-simulator
View on GitHub
Datacenter simulation toolkit for the OpenDC project
☆10Aug 24, 2020Updated 5 years ago
vwxyzjn / jupyter_disqus
View on GitHub
Add Disqus to your Jupyter notebook.
☆14Feb 14, 2018Updated 8 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
View on GitHub
A2C is a special case of PPO!
☆23May 20, 2022Updated 4 years ago
vwxyzjn / invalid-action-masking
View on GitHub
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
☆168May 9, 2023Updated 3 years ago
vwxyzjn / SC2AI
View on GitHub
Integrated Tensorforce and OpenAI Gym to train SC II game agents.
☆13Oct 26, 2019Updated 6 years ago
IgnisDa / rust-libs
View on GitHub
A set of packages for different rust needs
☆15Jan 11, 2023Updated 3 years ago
fanhanwei / FNN_MFRL_ArchDSE
View on GitHub
[DAC2024] Explainable Fuzzy Neural Network with Multi-Fidelity Reinforcement Learning for Micro-Architecture Design Space Exploration
☆10Jul 21, 2026Updated last week
CatherineMeng / FGYM-user-demo
View on GitHub
Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning
☆14Aug 12, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zt95 / infinite-horizon-off-policy-estimation
View on GitHub
☆13Apr 3, 2019Updated 7 years ago
TianhongDai / metaworld-sac
View on GitHub
☆12Aug 28, 2020Updated 5 years ago
IgnisDa / printr
View on GitHub
The smarter echo alternative
☆12Jan 28, 2022Updated 4 years ago
zhenv5 / atp
View on GitHub
ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation
☆10Apr 18, 2019Updated 7 years ago
airjerry1216 / VLSI-Physical-Design-Automation
View on GitHub
NTHU CS6135 VLSI實體設計自動化
☆11Mar 12, 2022Updated 4 years ago
mishig25 / synthetic-gradients-keras
View on GitHub
Keras implementation of `Decoupled Neural Interfaces using Synthetic Gradients`
☆13Oct 19, 2018Updated 7 years ago
davevad93 / pass-gen
View on GitHub
A simple password generator web app
☆16Mar 30, 2026Updated 3 months ago
NotAnyMike / RL-Football
View on GitHub
Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …
☆12Mar 29, 2019Updated 7 years ago
hegde95 / Agents_that_Listen
View on GitHub
Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory
☆14Jul 9, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kayuksel / combinatorial-bandit
View on GitHub
A method to search for a subset of best performing items wrt black-box reward function
☆15Jun 7, 2019Updated 7 years ago
junkwhinger / PPO_PyTorch
View on GitHub
This repo contains PPO implementation in PyTorch for LunarLander-v2
☆11Jun 26, 2020Updated 6 years ago
RickyMexx / SAC-tf2
View on GitHub
Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0
☆12May 13, 2020Updated 6 years ago
JiazhengZhang / NIRM
View on GitHub
NIRM: Dismantling Complex Networks by a Neural Model Trained from Tiny Networks
☆15Aug 27, 2022Updated 3 years ago
indylab / nxdo
View on GitHub
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆40Aug 27, 2021Updated 4 years ago
vwxyzjn / cleanba
View on GitHub
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆125Aug 22, 2024Updated last year
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
davevad93 / rest-countries-django-app
View on GitHub
Small project built with Django that retrieves data from the REST Countries API and the Wikipedia API.
☆22Mar 26, 2026Updated 4 months ago
MichalOp / MineRL2020
View on GitHub
☆16Aug 7, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
entity-neural-network / enn-trainer
View on GitHub
Reinforcement learning training framework for entity-gym environments.
☆17Mar 18, 2024Updated 2 years ago
Farama-Foundation / gym-examples
View on GitHub
Example code for the Gym documentation
☆73Jun 9, 2023Updated 3 years ago
UniquezCs / Narwhal-volumetric-video-streaming-system
View on GitHub
☆12Feb 17, 2022Updated 4 years ago
britig / Hierarchical-Program-Triggered-RL
View on GitHub
This folder contains the experiments and code for Hierarchical Program Triggered RL paper
☆14Jan 7, 2022Updated 4 years ago
Bam4d / Neural-Game-Engine
View on GitHub
Code to reproduce Neural Game Engine experiments and pre-trained models
☆41Jun 22, 2022Updated 4 years ago
morning9393 / HAPPO-HATRPO
View on GitHub
☆48Nov 29, 2021Updated 4 years ago
zhanghuanhuan1994 / arsenal
View on GitHub
☆12Apr 12, 2022Updated 4 years ago