wangyuhuix/TRGPPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangyuhuix/TRGPPO)

wangyuhuix / TRGPPO

☆34

Alternatives and similar repositories for TRGPPO

Users that are interested in TRGPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangyuhuix / TrulyPPO
View on GitHub
☆29Nov 21, 2022Updated 3 years ago
christosbampis / NARX_QoE_release
View on GitHub
☆13May 29, 2018Updated 8 years ago
wulfebw / muzero
View on GitHub
A python implemenation of tabular MuZero for educational purposes
☆21Dec 11, 2019Updated 6 years ago
mansimov / acktr
View on GitHub
☆17Sep 15, 2017Updated 8 years ago
holarissun / PCHID_code
View on GitHub
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
rllab-snu / Safe-Distributional-Actor-Critic
View on GitHub
Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)
☆21Nov 30, 2025Updated 7 months ago
KornbergFresnel / ModelRepo
View on GitHub
reproduce some RL or Multi-Agent models
☆35May 22, 2019Updated 7 years ago
tesslerc / GAC
View on GitHub
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Dec 17, 2019Updated 6 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
KeWang0622 / CS294_HW
View on GitHub
My solutions toward CS294 homework: Deep Reinforcement Learning
☆11Nov 14, 2018Updated 7 years ago
crowdAI / marlo-single-agent-starter-kit
View on GitHub
Round 1 Starter Kit for the MarLo challenge
☆21Sep 27, 2018Updated 7 years ago
PKU-RL / FEN
View on GitHub
FEN Code
☆41Nov 4, 2019Updated 6 years ago
hhexiy / opponent
View on GitHub
Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
☆71Apr 15, 2026Updated 3 months ago
ntsliyang / adaptive-bitrate-streaming
View on GitHub
DeeCamp 2019 Team 19
☆26Dec 30, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thu-media / Tiyuntsong
View on GitHub
A romantic self-play reinforcement learning approach for ABR video streaming
☆26Nov 29, 2019Updated 6 years ago
ludobouan / Q-learning-gridworld
View on GitHub
Reinforcement learning on gridworld with Q-learning
☆10Jan 28, 2017Updated 9 years ago
NNU-GISA / FoldingNet
View on GitHub
Organized code for the paper "FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation" (CVPR 2018).
☆10May 2, 2019Updated 7 years ago
apexrl / CoDAIL
View on GitHub
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆19Jun 17, 2021Updated 5 years ago
henryslzhao / RL4Recsys
View on GitHub
paper list in the area of reinforcenment learning for recommendation systems
☆25Aug 4, 2020Updated 5 years ago
songquanpeng / hexo-theme-lightx
View on GitHub
Hexo theme lightx.
☆10Oct 2, 2020Updated 5 years ago
ermongroup / multiagent-gail
View on GitHub
☆84Dec 4, 2018Updated 7 years ago
andreamad8 / QDREN
View on GitHub
Question Dependent Recurrent Entity Network
☆13Sep 21, 2017Updated 8 years ago
tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
florensacc / rllab-curriculum
View on GitHub
☆143Feb 26, 2019Updated 7 years ago
brett-daley / dqn-lambda
View on GitHub
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆25May 20, 2024Updated 2 years ago
jvking / reddit-RL-simulator
View on GitHub
This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit
☆21Sep 10, 2016Updated 9 years ago
AdamStelmaszczyk / dqn
View on GitHub
TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)
☆40Jul 31, 2020Updated 5 years ago
newtrip-project / pitree
View on GitHub
Practical Implementation of ABR Algorithms Using Decision Trees (ACM MM 2019)
☆37Apr 21, 2024Updated 2 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
williamsentosa95 / cellreplay
View on GitHub
Record and replay for cellular network emulation
☆32May 6, 2025Updated last year
tdavchev / option-critic
View on GitHub
A Tensorflow implementation of the Option-Critic Architecture
☆75Jun 1, 2017Updated 9 years ago
hongzimao / input_driven_rl_example
View on GitHub
Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)
☆31May 6, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
quovadim / RL-Cache
View on GitHub
☆30Apr 13, 2019Updated 7 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
kaist-ina / neuroscaler-public
View on GitHub
This is an official GitHub repository for the paper, "Engorgio: Neural Enhancement at Scale"
☆35Mar 12, 2023Updated 3 years ago
lanyavik / BAIL
View on GitHub
☆18Jul 13, 2022Updated 4 years ago
Underflow / reinforcement-2048
View on GitHub
A reinforcement learning algorithm for the 2048 game
☆20Mar 25, 2014Updated 12 years ago
namilus / nn
View on GitHub
Build and Train Neural Networks in Emacs Lisp
☆15May 22, 2024Updated 2 years ago
StanfordSNR / indigo
View on GitHub
Empirically learned congestion control by imitation learning with RNNs
☆47Jan 31, 2019Updated 7 years ago