Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 5 years ago
Alternatives and similar repositories for rudder-demonstration-code
Users that are interested in rudder-demonstration-code are comparing it to the libraries listed below
Sorting:
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- ☆14Oct 10, 2025Updated 4 months ago
- ☆31Jan 16, 2023Updated 3 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆44Jun 11, 2020Updated 5 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 3 years ago
- Benchmark data for d3rlpy☆21Nov 28, 2023Updated 2 years ago
- PyTorch Implementation of "Language as an Abstraction for Hierarchical Deep Reinforcement Learning" paper☆26Feb 14, 2022Updated 4 years ago
- ☆11Apr 22, 2020Updated 5 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆126Aug 30, 2024Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated last year
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal R…☆35Jan 25, 2023Updated 3 years ago
- ☆33Aug 30, 2024Updated last year
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning☆240Nov 1, 2022Updated 3 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- Use mason's rule to simplify signal flow graphs in MATLAB☆11Mar 5, 2020Updated 5 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- Mobility Load Management in Cellular Networks☆10Jul 14, 2023Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- Halite 3 reloader☆12Dec 15, 2018Updated 7 years ago
- Code accompanying paper, Forward Prediction for Physical Reasoning☆11Oct 12, 2021Updated 4 years ago
- minimalist vector ad☆11Feb 11, 2024Updated 2 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Genetic algorithm for reducing the power loss in an electrical network consisting out of 119 nodes.☆12May 5, 2017Updated 8 years ago
- Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning☆11Jul 20, 2022Updated 3 years ago
- A Python library for working with and training Hidden Markov Models with Poisson emissions.☆10Aug 14, 2017Updated 8 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.☆11Dec 20, 2018Updated 7 years ago