bnelo12/PPO-Implemnetation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bnelo12/PPO-Implemnetation)

bnelo12 / PPO-Implemnetation

Implementation of PPO for CartPole-v1

☆10

Alternatives and similar repositories for PPO-Implemnetation

Users that are interested in PPO-Implemnetation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jeffasante / grpo-maze-solver
View on GitHub
A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).
☆12Feb 9, 2025Updated last year
CORE-Robotics-Lab / ICCT
View on GitHub
☆18Jun 26, 2026Updated 3 weeks ago
jacklishufan / MAIN2021
View on GitHub
A Multi-Stage Audiogram Interpretation Network
☆14Dec 20, 2021Updated 4 years ago
CORE-Robotics-Lab / Interpretable_DDTS_AISTATS2020
View on GitHub
Public code for implementation and experiments with differentiable decision trees.
☆32Oct 17, 2024Updated last year
duckzhao / air_campaign_rl
View on GitHub
基于强化学习的游戏空战推演
☆13May 8, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
guytenn / Act2Vec
View on GitHub
☆13May 10, 2019Updated 7 years ago
americast / DRL_HVAC
View on GitHub
Optimising electricity expenditure in an HVAC system under dynamic electricity pricing scheme and weather conditions using a DDPG model.
☆26Feb 6, 2022Updated 4 years ago
torymac1 / Simulation-icc17-paper
View on GitHub
Simlulation code for paper "Cooperative caching for spectrum access in cognitive radio networks".
☆10Oct 24, 2017Updated 8 years ago
benellis3 / pymarl2
View on GitHub
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆19Aug 20, 2023Updated 2 years ago
benellis3 / mappo
View on GitHub
☆18Aug 14, 2023Updated 2 years ago
DIG-Beihang / AMI
View on GitHub
☆16Aug 12, 2024Updated last year
nuwuxian / RL_adv_valuediff
View on GitHub
☆16Mar 24, 2023Updated 3 years ago
yandachen / In-context-Tuning
View on GitHub
Implementation code for the paper "Meta-learning via Language Model In-context Tuning" (ACL 2022)
☆25Jun 16, 2022Updated 4 years ago
quantumiracle / Cascading-Decision-Tree
View on GitHub
Open-source code for paper CDT: Cascading Decision Trees for Explainable Reinforcement Learning
☆41Oct 31, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Qi-Pang / MDPFuzz
View on GitHub
Official implementation of ISSTA 2022 paper: MDPFuzz: Testing Models Solving Markov Decision Processes.
☆25Dec 17, 2022Updated 3 years ago
Its-its / xpath-scraper
View on GitHub
Makes it simple to scrape websites with xpath structs.
☆13Mar 10, 2023Updated 3 years ago
DestructoSphere / android_kernel_huawei_msm8909
View on GitHub
Huawei scl-l02 kernel source
☆11Dec 8, 2016Updated 9 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
aadya-agrawal / SimpliHuMoN
View on GitHub
Simple Transformer-based 3D human motion prediction model
☆22Apr 4, 2026Updated 3 months ago
VoltaireNoir / markd
View on GitHub
Bookmark directories for easy directory-hopping in the terminal
☆14Sep 10, 2025Updated 10 months ago
AxiomaticUncertainty / Deep-Q-Learning-for-Tic-Tac-Toe
View on GitHub
Find more info @ youtube.com/axiomaticuncertainty
☆11Aug 20, 2018Updated 7 years ago
aureleoules / ecdsa
View on GitHub
ecdsa operations in go
☆10Oct 21, 2019Updated 6 years ago
turing-roche / turing-roche-documentation
View on GitHub
☆29Mar 5, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
vmoro1 / multimat
View on GitHub
Official implementation of MultiMat (Newton 2025)
☆24Mar 10, 2025Updated last year
higra / Higra-Notebooks
View on GitHub
Demonstration and tutorial notebooks for the Higra library
☆13Sep 29, 2025Updated 9 months ago
tgangwani / GA3C-DeepNavigation
View on GitHub
Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"
☆63May 30, 2017Updated 9 years ago
evansalter / shairport-sync-docker-pi
View on GitHub
Docker image to run shairport-sync on a Raspberry Pi
☆12Apr 11, 2019Updated 7 years ago
ThomasRobertFr / thesis
View on GitHub
My PhD manuscript LaTeX code and the slides for the defense
☆11Feb 2, 2022Updated 4 years ago
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆11Mar 3, 2026Updated 4 months ago
jshtok / StarNet
View on GitHub
Pytorch implementation of the StarNet paper algorithm
☆10Jan 25, 2022Updated 4 years ago
prologin / sadm
View on GitHub
Documentation, configs, scripts and services used for the finals of the Prologin contest
☆12Oct 31, 2022Updated 3 years ago
IouJenLiu / CMAE
View on GitHub
☆50Jul 23, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Uriopass / Stacklang
View on GitHub
A home-made stack based language heavily inspired from PostScript
☆11Jan 24, 2020Updated 6 years ago
e-bug / fine-grained-evals
View on GitHub
[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 3 years ago
Danielhp95 / gym-connect4
View on GitHub
An OpenAI Gym implementation of the famous Connect 4 environment
☆12Jan 11, 2021Updated 5 years ago
FatemehShiri / Spatial-MM
View on GitHub
☆12Jan 10, 2025Updated last year
blueridge-data / bufferbin
View on GitHub
☆14Jul 9, 2023Updated 3 years ago
henkwymeersch / DeepRLVehicularLocalization
View on GitHub
Decentralized Scheduling for Cooperative Localization with Deep Reinforcement Learning
☆35Jun 1, 2019Updated 7 years ago
moveit / moveit_ros
View on GitHub
THIS REPO HAS MOVED TO https://github.com/ros-planning/moveit
☆71Nov 28, 2016Updated 9 years ago