Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"
☆17Dec 17, 2019Updated 6 years ago
Alternatives and similar repositories for upsideDownRL
Users that are interested in upsideDownRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Learning RNN Hierarchies☆45Jun 22, 2016Updated 9 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Accompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 V…☆15Aug 2, 2017Updated 8 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- Used torch.optim.lr_scheduler.CosineAnnealingLR()☆32Jul 22, 2019Updated 6 years ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆78Aug 13, 2020Updated 5 years ago
- ☆23Nov 9, 2021Updated 4 years ago
- Code for Attentive Recurrent Comparators☆58Mar 3, 2017Updated 9 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Jan 9, 2022Updated 4 years ago
- ☆17Jun 1, 2021Updated 4 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆10Jan 19, 2026Updated 2 months ago
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Jan 16, 2020Updated 6 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆49Mar 3, 2025Updated last year
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- ☆54Oct 28, 2021Updated 4 years ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 5 years ago
- ☆17May 30, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Solving Competition Geometry Problems in Lean☆33Aug 26, 2025Updated 7 months ago
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- Personal summaries of deep learning and AI papers☆31Jan 10, 2021Updated 5 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Sep 19, 2017Updated 8 years ago
- ☆13May 18, 2021Updated 4 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- Code for the "Binding via Reconstruction Clustering" paper☆21Jan 19, 2016Updated 10 years ago
- Functional ANOVA☆28Nov 17, 2014Updated 11 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 2 years ago
- Benchmarking TD3 and DDPG on PyBullet☆54Jun 19, 2019Updated 6 years ago
- FNV hash collision generator☆12Mar 2, 2017Updated 9 years ago
- Source Codes of graphSEAT (CIKM'20)☆16Jan 19, 2021Updated 5 years ago
- Performances of Reinforcement Learning Agents☆53Dec 19, 2019Updated 6 years ago
- A Docker image to run the OpenAI Gym environment in Jupyter notebooks. No host system X11 support needed, graphical parts of the Gym are …☆10Nov 4, 2016Updated 9 years ago