Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 5 years ago
Alternatives and similar repositories for rudder-demonstration-code
Users that are interested in rudder-demonstration-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- ☆14Oct 10, 2025Updated 6 months ago
- Codebase for "Causal Induction from Visual Observations for Goal-Directed Tasks"☆14Feb 25, 2020Updated 6 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆14Jul 28, 2024Updated last year
- Code to reproduce results on toy tasks and companion blog for the paper.☆22Jun 8, 2022Updated 3 years ago
- ☆33Aug 30, 2024Updated last year
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 4 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- ☆15May 20, 2023Updated 2 years ago
- Genetic algorithm for reducing the power loss in an electrical network consisting out of 119 nodes.☆12May 5, 2017Updated 8 years ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated last year
- Benchmark data for d3rlpy☆21Nov 28, 2023Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning☆242Nov 1, 2022Updated 3 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆41Sep 13, 2023Updated 2 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 7 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Fully Customized Side Menu☆11Jul 27, 2020Updated 5 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Mar 21, 2024Updated 2 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- ☆15Oct 26, 2020Updated 5 years ago
- A tool to automatically label, classify, and count marine debris in your aerial imagery. Designed to automate the tedious parts of standi…☆11Nov 22, 2023Updated 2 years ago
- ☆34Aug 22, 2025Updated 8 months ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆127Aug 30, 2024Updated last year