widmi/rudder-a-practical-tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/widmi/rudder-a-practical-tutorial)

widmi / rudder-a-practical-tutorial

A practical step-by-step guide to applying RUDDER

☆36

Alternatives and similar repositories for rudder-a-practical-tutorial

Users that are interested in rudder-a-practical-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ml-jku / rudder
View on GitHub
RUDDER: Return Decomposition for Delayed Rewards
☆49Sep 17, 2020Updated 5 years ago
ml-jku / rudder-demonstration-code
View on GitHub
Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 6 years ago
ml-jku / align-rudder
View on GitHub
Code to reproduce results on toy tasks and companion blog for the paper.
☆23Jun 8, 2022Updated 4 years ago
rystrauss / dopamax
View on GitHub
Reinforcement learning in pure JAX.
☆13Jun 24, 2026Updated 3 weeks ago
pkumusic / E-DRL
View on GitHub
Exploration Strategies for Deep Reinforcement Learning
☆39Oct 31, 2018Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
neale / avoiding-side-effects
View on GitHub
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Jun 3, 2021Updated 5 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
nnaisense / MAX
View on GitHub
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆81Jul 23, 2019Updated 7 years ago
bprabhakar / upside-down-reinforcement-learning
View on GitHub
Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.
☆12May 1, 2020Updated 6 years ago
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 8 years ago
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
brett-daley / dqn-lambda
View on GitHub
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆25May 20, 2024Updated 2 years ago
facebookresearch / svg
View on GitHub
On the model-based stochastic value gradient for continuous reinforcement learning
☆58Mar 6, 2026Updated 4 months ago
shi27feng / transformers.satisfy
View on GitHub
propositional satisfiability problem (SAT) goes neural and deep
☆12Aug 17, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
222464 / MiniNeoRL
View on GitHub
Simple, small, fully-connected Python version of NeoRL
☆11Jan 29, 2016Updated 10 years ago
behas / ransomware-dataset
View on GitHub
Economics of Ransomware | Dataset
☆15May 2, 2018Updated 8 years ago
DwangoMediaVillage / marltas_core
View on GitHub
Distributed & asynchronous DQN implementation using gRPC and PyTorch.
☆10Feb 15, 2021Updated 5 years ago
Gy-Hu / AIG2INV
View on GitHub
DeepIC3: Guiding IC3 Algorithms by Graph Neural Network Clause Prediction (ASP-DAC 2024)
☆13Nov 2, 2023Updated 2 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
VowpalWabbit / estimators
View on GitHub
Estimators to perform off-policy evaluation
☆13Sep 3, 2024Updated last year
atapour / ransomware-classification
View on GitHub
Training and testing pipeline for ransomware classification based on screenshots of the splash screens or ransom notes (https://arxiv.org…
☆11Jul 19, 2020Updated 6 years ago
icaros-usc / dqd-rl
View on GitHub
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
☆22Oct 3, 2022Updated 3 years ago
whyjay / curiosity-bottleneck
View on GitHub
Repository for our ICML 2019 paper: Curiosity-Bottleneck
☆34Nov 21, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
taodav / nsrs
View on GitHub
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Jul 16, 2024Updated 2 years ago
LiXirong / pytorch-r2d2
View on GitHub
PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)
☆13Nov 14, 2019Updated 6 years ago
Kajiyu / kanerva_machine
View on GitHub
The implementation of "The Kanerva Machine" with Pytorch and Pyro
☆12Jun 14, 2018Updated 8 years ago
toshikwa / sac-discrete.pytorch
View on GitHub
PyTorch implementation of SAC-Discrete.
☆316Jul 25, 2024Updated last year
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
dyth / causal-entropic-forces
View on GitHub
Python3 reimplementation of Wissner-Gross & Freer, 2013
☆15Dec 18, 2025Updated 7 months ago
ml-jku / baselines-rudder
View on GitHub
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
☆268Oct 24, 2019Updated 6 years ago
mcgillmrl / prob_mbrl
View on GitHub
A library of probabilistic model based RL algorithms in pytorch
☆107Apr 14, 2021Updated 5 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hr0nix / dejax
View on GitHub
Accelerated replay buffers in JAX
☆46Sep 17, 2022Updated 3 years ago
arayabrain / GenerativeControl
View on GitHub
Code for a generative controller for the AI Gym cartpole task
☆15Feb 22, 2017Updated 9 years ago
orbtl-ai / DebrisScan
View on GitHub
A tool to automatically label, classify, and count marine debris in your aerial imagery. Designed to automate the tedious parts of standi…
☆11Nov 22, 2023Updated 2 years ago
benfulton / Algorithmic-Alley
View on GitHub
Code from posts at AlgorthmicAlley.com
☆15Nov 27, 2019Updated 6 years ago
CatherineMeng / FGYM-user-demo
View on GitHub
Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning
☆14Aug 12, 2021Updated 4 years ago
facebookresearch / hanabi_SAD
View on GitHub
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
☆103Jun 22, 2022Updated 4 years ago
Chen-Wang-CUHK / Training-Free-and-Ref-Free-Summ-Evaluation
View on GitHub
The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…
☆14May 6, 2023Updated 3 years ago