Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 6 years ago
Alternatives and similar repositories for rudder-demonstration-code
Users that are interested in rudder-demonstration-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- ☆14Oct 10, 2025Updated 8 months ago
- Codebase for "Causal Induction from Visual Observations for Goal-Directed Tasks"☆14Feb 25, 2020Updated 6 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆31Jan 16, 2023Updated 3 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆14Jul 28, 2024Updated last year
- Code to reproduce results on toy tasks and companion blog for the paper.☆23Jun 8, 2022Updated 4 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆44Jun 11, 2020Updated 6 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.☆12Sep 8, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- Genetic algorithm for reducing the power loss in an electrical network consisting out of 119 nodes.☆12May 5, 2017Updated 9 years ago
- ☆15May 20, 2023Updated 3 years ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated 2 years ago
- Benchmark data for d3rlpy☆21Nov 28, 2023Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning☆245Nov 1, 2022Updated 3 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆42Sep 13, 2023Updated 2 years ago
- Code for implemeting a conditional DDPM trained on CIFAR10☆14Jan 15, 2024Updated 2 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 8 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- Steam Inventory Lister and Inventory Worth Calculator with Next.js (React) + Tailwind + DaisyUI and ChakraUI☆18Oct 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Rubrics☆86Updated this week
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆15Oct 26, 2020Updated 5 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14May 6, 2023Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago