wangyuhuix / TRGPPOView external linksLinks
☆33Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for TRGPPO
Users that are interested in TRGPPO are comparing it to the libraries listed below
Sorting:
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- ☆13May 29, 2018Updated 7 years ago
- discrete gate sizing☆14Nov 23, 2020Updated 5 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- ☆17Sep 15, 2017Updated 8 years ago
- Dockerfile for RL research. Including MuJoCo / DMC / PyTorch / Tensoflow / Atari support.☆16Jan 5, 2022Updated 4 years ago
- A reinforcement learning algorithm for the 2048 game☆20Mar 25, 2014Updated 11 years ago
- reproduce some RL or Multi-Agent models☆35May 22, 2019Updated 6 years ago
- FEN Code☆40Nov 4, 2019Updated 6 years ago
- Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale☆19May 27, 2020Updated 5 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit☆20Sep 10, 2016Updated 9 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago
- ☆85Dec 4, 2018Updated 7 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆25Aug 4, 2020Updated 5 years ago
- Record and replay for cellular network emulation☆29May 6, 2025Updated 9 months ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- Objective Quality-of-Experience Model Benchmark☆26Feb 26, 2020Updated 5 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Dec 17, 2019Updated 6 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- DeeCamp 2019 Team 19☆26Dec 30, 2022Updated 3 years ago
- A Continual Multi-agent RL testbed based on Hanabi☆32Aug 1, 2021Updated 4 years ago
- A romantic self-play reinforcement learning approach for ABR video streaming☆26Nov 29, 2019Updated 6 years ago
- Unofficial Implementation of Oboe (SIGCOMM'18).☆30Mar 24, 2022Updated 3 years ago
- ☆12Sep 5, 2018Updated 7 years ago
- Revisiting Rainbow☆75Jun 9, 2021Updated 4 years ago
- Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning☆27May 15, 2020Updated 5 years ago
- ☆29Apr 13, 2019Updated 6 years ago
- ☆135Jul 25, 2024Updated last year
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- ☆13Nov 5, 2024Updated last year
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆78Aug 13, 2020Updated 5 years ago
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago