Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.
☆78Aug 13, 2020Updated 5 years ago
Alternatives and similar repositories for Upside-Down-Reinforcement-Learning
Users that are interested in Upside-Down-Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆13Oct 25, 2023Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 5 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆32Jul 20, 2022Updated 3 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Jan 16, 2020Updated 6 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆81Oct 25, 2020Updated 5 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Jun 2, 2019Updated 6 years ago
- HaVSA (Have-Saa) is a Haskell implementation of the Version Space Algebra Machine Learning technique described by Tessa Lau.☆12Jul 8, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Oct 4, 2020Updated 5 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆295Feb 24, 2021Updated 5 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.☆11May 1, 2020Updated 5 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 5 years ago
- oracle-structured minimization method☆13Sep 1, 2021Updated 4 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Dec 18, 2020Updated 5 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Auxiliary variable Markov chain Monte Carlo methods☆10Oct 24, 2017Updated 8 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- JAX implementations of core Deep RL algorithms☆84May 2, 2022Updated 3 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24May 30, 2019Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Jun 6, 2019Updated 6 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆51Jun 11, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the publication Learning to Reason with Third-Order Tensor Products.☆41Jan 14, 2019Updated 7 years ago
- Repository for the paper "Planning to Explore via Self-Supervised World Models"☆235Feb 10, 2023Updated 3 years ago
- Deep Q-Learning Auto Market Maker☆12Jun 12, 2021Updated 4 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago