illidanlab/rpg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/illidanlab/rpg)

illidanlab / rpg

Ranking Policy Gradient

☆23

Alternatives and similar repositories for rpg

Users that are interested in rpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
mcgillmrl / robot_learning
View on GitHub
ROS package for robot learning
☆17Oct 16, 2019Updated 6 years ago
HarrieO / RankingComplexLayouts
View on GitHub
Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"
☆16Aug 28, 2018Updated 7 years ago
schroederdewitt / mackrl
View on GitHub
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆34Dec 1, 2019Updated 6 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
ehknight / natural-gradient-deep-q-learning
View on GitHub
☆23Oct 7, 2018Updated 7 years ago
anonymous-author1 / DDRL
View on GitHub
☆22Dec 7, 2022Updated 3 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ShibiHe / Q-Optimality-Tightening
View on GitHub
This is my implementation of the Optimality Tightening
☆37Apr 26, 2017Updated 9 years ago
sparisi / td-reg
View on GitHub
TD-Regularized Actor-Critic Methods
☆37Dec 26, 2019Updated 6 years ago
RLAgent / state-marginal-matching
View on GitHub
Efficient Exploration via State Marginal Matching (2019)
☆70Jun 30, 2019Updated 7 years ago
rlbayes / rllabplusplus
View on GitHub
☆162Jul 21, 2017Updated 9 years ago
uncharted-technologies / robust-domain-randomization
View on GitHub
Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"
☆12Nov 22, 2022Updated 3 years ago
Miffyli / policy-supervectors
View on GitHub
Creating fixed-length vectors to describe RL/GA policies
☆20Oct 23, 2021Updated 4 years ago
wangyuhuix / TrulyPPO
View on GitHub
☆29Nov 21, 2022Updated 3 years ago
bhairavmehta95 / ant-env
View on GitHub
Ant Gather and Ant Maze envs, separated from RLLab
☆11Aug 2, 2018Updated 7 years ago
google-research / policy-learning-landscape
View on GitHub
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Jan 16, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ermongroup / CalibratedModelBasedRL
View on GitHub
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆54May 15, 2019Updated 7 years ago
xeniaqian94 / RLeToR
View on GitHub
A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…
☆18Dec 8, 2017Updated 8 years ago
pfnet-research / capg
View on GitHub
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 8 years ago
jkulhanek / gym-deepmindlab-env
View on GitHub
Gym implementation of connector to Deepmind lab
☆12Mar 26, 2019Updated 7 years ago
aviralkumar2907 / BEAR
View on GitHub
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆164Jul 17, 2020Updated 6 years ago
henry-prior / multimodal-rl
View on GitHub
Solving reinforcement learning tasks which require language and vision
☆33Apr 4, 2023Updated 3 years ago
valeoai / rainbow-iqn-apex
View on GitHub
Distributed Rainbow-IQN for Atari
☆80Dec 17, 2019Updated 6 years ago
jesbu1 / carl
View on GitHub
Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings
☆14Nov 22, 2022Updated 3 years ago
qingyun-wu / NonstationaryBanditLib
View on GitHub
☆15Jan 20, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
matsuolab / BREMEN
View on GitHub
Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)
☆54Jul 7, 2021Updated 5 years ago
floringogianu / categorical-dqn
View on GitHub
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 8 years ago
chainer / chainerrl-visualizer
View on GitHub
☆54Jan 13, 2023Updated 3 years ago
StatsDLMathsRecomSys / Generative-Graph-Convolutional-Network-for-Growing-Graphs
View on GitHub
☆12Feb 29, 2020Updated 6 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
openai / phasic-policy-gradient
View on GitHub
Code for the paper "Phasic Policy Gradient"
☆266Apr 2, 2023Updated 3 years ago