Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 5 years ago
Alternatives and similar repositories for rudder-demonstration-code
Users that are interested in rudder-demonstration-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- ☆14Oct 10, 2025Updated 5 months ago
- Codebase for "Causal Induction from Visual Observations for Goal-Directed Tasks"☆14Feb 25, 2020Updated 6 years ago
- Code for Invariant Policy Optimization☆15Jul 22, 2020Updated 5 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Oct 24, 2024Updated last year
- ☆13Dec 6, 2018Updated 7 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- Deep direct reinforcement learning for financial signal representation and trading☆32Oct 7, 2020Updated 5 years ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- ☆33Aug 30, 2024Updated last year
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆44Jun 11, 2020Updated 5 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.☆11Sep 8, 2023Updated 2 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- ☆14May 20, 2023Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆36May 13, 2024Updated last year
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Code for the paper "SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks"☆12Jan 17, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆40Sep 13, 2023Updated 2 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 7 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Fully Customized Side Menu☆11Jul 27, 2020Updated 5 years ago
- Steam Inventory Lister and Inventory Worth Calculator with Next.js (React) + Tailwind + DaisyUI and ChakraUI☆18Oct 2, 2023Updated 2 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Mar 21, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆10Oct 15, 2020Updated 5 years ago
- ☆15Oct 26, 2020Updated 5 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14May 6, 2023Updated 2 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago