wangyuhuix/TrulyPPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangyuhuix/TrulyPPO)

wangyuhuix / TrulyPPO

☆29

Alternatives and similar repositories for TrulyPPO

Users that are interested in TrulyPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangyuhuix / TRGPPO
View on GitHub
☆34Nov 21, 2022Updated 3 years ago
wisnunugroho21 / reinforcement_learning_phasic_policy_gradient
View on GitHub
Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
☆20Oct 5, 2021Updated 4 years ago
wisnunugroho21 / reinforcement_learning_truly_ppo
View on GitHub
Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
☆22Nov 9, 2025Updated 8 months ago
jqueeney / geppo
View on GitHub
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆29Jul 24, 2023Updated 3 years ago
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
DartML / PPO-Stein-Control-Variate
View on GitHub
Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
deligentfool / GAIL_pytorch
View on GitHub
The implement of GAIL with pytorch
☆14Mar 11, 2020Updated 6 years ago
Chenan-W / Python-Trajectory-Tracking-Control-for-UAV
View on GitHub
单无人机对螺旋轨迹跟踪的实物实验
☆10May 22, 2023Updated 3 years ago
mingzhangPHD / Adversarial-Imitation-Learning
View on GitHub
Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration
☆19Feb 9, 2021Updated 5 years ago
Prasham-Patel / CARLA_Motion_Planning_Project
View on GitHub
☆12May 29, 2022Updated 4 years ago
xeniaqian94 / RLeToR
View on GitHub
A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…
☆18Dec 8, 2017Updated 8 years ago
chloechsu / revisiting-ppo
View on GitHub
☆48Sep 23, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
djs55 / ocaml-spdy
View on GitHub
Implementation of the SPDY protocol in ocaml
☆14Oct 2, 2011Updated 14 years ago
jw1401 / PPO-Tensorflow-2.0
View on GitHub
Proximal Policy Optimization with Tensorflow 2.0
☆32Oct 14, 2019Updated 6 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
Plankson / CSIRL
View on GitHub
[IEEE Transactions on Intelligent Transportation Systems] Curricular Subgoal for Inverse Reinforcement Learning
☆18Jul 31, 2023Updated 2 years ago
RomainLaroche / SPIBB
View on GitHub
Safe Policy Improvement with Baseline Bootstrapping
☆26May 5, 2020Updated 6 years ago
N-H-Shimada / QCoin-CGF-2020
View on GitHub
☆10Jan 21, 2021Updated 5 years ago
lionelblonde / sam-pytorch
View on GitHub
PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
☆10Nov 22, 2019Updated 6 years ago
hasauino / Implemented_RRTs
View on GitHub
☆10Aug 15, 2016Updated 9 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
Dragon-Zhuang / BPPO
View on GitHub
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆95Dec 13, 2023Updated 2 years ago
Miraclemarvel55 / LLaMA-MOSS-RLHF-LoRA
View on GitHub
用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]
☆21May 16, 2023Updated 3 years ago
spawnaga / FlexTrader
View on GitHub
A multi-task deep reinforcement learning model for trading futures contracts using the Interactive Brokers API and TensorFlow
☆15Feb 8, 2023Updated 3 years ago
Aleum / AlphaGo
View on GitHub
9x9 AlphaGo
☆13Jul 27, 2016Updated 10 years ago
janestreet / configurator
View on GitHub
Helper library for gathering system configuration
☆20Sep 5, 2019Updated 6 years ago
manxing-du / cmdp-rtb
View on GitHub
☆10Apr 18, 2017Updated 9 years ago
mkschleg / GVFN
View on GitHub
☆10Apr 24, 2021Updated 5 years ago
karanchawla / motion-planning-playground
View on GitHub
Playground for motion planning and controls algorithms.
☆15Aug 15, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yaoliucs / PQL
View on GitHub
Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"
☆11Oct 22, 2020Updated 5 years ago
yifengtao / CADRE
View on GitHub
CADRE: Contextual Attention-based Drug REsponse
☆12Nov 23, 2020Updated 5 years ago
vbmithr / ocaml-fix
View on GitHub
FIX protocol for OCaml
☆15Feb 15, 2020Updated 6 years ago
ramp-kits / rl_simulator
View on GitHub
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Mar 13, 2026Updated 4 months ago
kasper9n / redlux
View on GitHub
AAC decoder for MPEG-4 and AAC files, with rodio support
☆18Updated this week
lqtrung1998 / mwp_cot_design
View on GitHub
☆14Oct 11, 2023Updated 2 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago