A practical step-by-step guide to applying RUDDER
☆35Nov 12, 2019Updated 6 years ago
Alternatives and similar repositories for rudder-a-practical-tutorial
Users that are interested in rudder-a-practical-tutorial are comparing it to the libraries listed below
Sorting:
- RUDDER: Return Decomposition for Delayed Rewards☆48Sep 17, 2020Updated 5 years ago
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆22Jun 8, 2022Updated 3 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.☆11May 1, 2020Updated 5 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 2 months ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- ☆36Aug 10, 2018Updated 7 years ago
- Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)☆45Oct 25, 2021Updated 4 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.☆21Nov 17, 2023Updated 2 years ago
- Code for EmBERT, a transformer model for embodied, language-guided visual task completion.☆60Apr 10, 2024Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- ☆30Jan 17, 2022Updated 4 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated last year
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Apr 28, 2023Updated 2 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- Repository for our ICML 2019 paper: Curiosity-Bottleneck☆34Nov 21, 2022Updated 3 years ago
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Sudoku solver in Golang☆10Sep 6, 2020Updated 5 years ago
- ☆11Jun 4, 2023Updated 2 years ago
- Topic modelling and co-occurrence analysis of the bio-economy☆10Jul 17, 2017Updated 8 years ago
- Man in the middle attack demo☆11Jan 14, 2018Updated 8 years ago
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- ☆44Dec 4, 2018Updated 7 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- ☆12Jul 11, 2022Updated 3 years ago
- ☆13May 30, 2019Updated 6 years ago
- Python bindings for OptFrame C++ Functional Core☆13May 18, 2025Updated 9 months ago