Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"
☆17Dec 17, 2019Updated 6 years ago
Alternatives and similar repositories for upsideDownRL
Users that are interested in upsideDownRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Learning RNN Hierarchies☆45Jun 22, 2016Updated 9 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 V…☆15Aug 2, 2017Updated 8 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆78Aug 13, 2020Updated 5 years ago
- ☆23Nov 9, 2021Updated 4 years ago
- Code for Attentive Recurrent Comparators☆56Mar 3, 2017Updated 9 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Jan 9, 2022Updated 4 years ago
- ☆13Dec 28, 2023Updated 2 years ago
- ☆17Jun 1, 2021Updated 4 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Jan 16, 2020Updated 6 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- ☆11Oct 5, 2020Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆49Mar 3, 2025Updated last year
- Code for the work in: https://www.nature.com/articles/s41534-020-00305-x Basically a generative neural network to tackle the classical ca…☆15Apr 17, 2025Updated last year
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- ☆54Oct 28, 2021Updated 4 years ago
- ☆13Aug 10, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 5 years ago
- ☆17May 30, 2018Updated 7 years ago
- Solving Competition Geometry Problems in Lean☆37Aug 26, 2025Updated 8 months ago
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- Personal summaries of deep learning and AI papers☆30Jan 10, 2021Updated 5 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Sep 19, 2017Updated 8 years ago
- ☆13May 18, 2021Updated 4 years ago
- ☆41Apr 27, 2022Updated 4 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the "Binding via Reconstruction Clustering" paper☆21Jan 19, 2016Updated 10 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 3 years ago
- A gMLP (gated MLP) implementation in Tensorflow 1.x, as described in the paper "Pay Attention to MLPs" (2105.08050).☆16Aug 31, 2021Updated 4 years ago
- FNV hash collision generator☆12Mar 2, 2017Updated 9 years ago
- Source Codes of graphSEAT (CIKM'20)☆16Jan 19, 2021Updated 5 years ago