Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 6 years ago
Alternatives and similar repositories for rudder-demonstration-code
Users that are interested in rudder-demonstration-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- ☆14Oct 10, 2025Updated 7 months ago
- Codebase for "Causal Induction from Visual Observations for Goal-Directed Tasks"☆14Feb 25, 2020Updated 6 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- ☆12Oct 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Dec 6, 2018Updated 7 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- ☆33Aug 30, 2024Updated last year
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆44Jun 11, 2020Updated 5 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.☆11Sep 8, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- Genetic algorithm for reducing the power loss in an electrical network consisting out of 119 nodes.☆12May 5, 2017Updated 9 years ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated 2 years ago
- Benchmark data for d3rlpy☆21Nov 28, 2023Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for implemeting a conditional DDPM trained on CIFAR10☆14Jan 15, 2024Updated 2 years ago
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal R…☆35Jan 25, 2023Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆10Oct 15, 2020Updated 5 years ago
- ☆15Oct 26, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆127Aug 30, 2024Updated last year
- MATLAB code and data for the paper “Optimal energy management of offshore wind farms considering the combination of overplanting and dyna…☆18Jun 23, 2024Updated last year
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year