bay3s / rl-squaredView external linksLinks
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning
☆15May 24, 2023Updated 2 years ago
Alternatives and similar repositories for rl-squared
Users that are interested in rl-squared are comparing it to the libraries listed below
Sorting:
- 뇌를 자극하는 시스템 프로그래밍☆13Mar 2, 2023Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆72Jan 1, 2022Updated 4 years ago
- ☆12Sep 1, 2024Updated last year
- 记事本,提醒☆12Jan 16, 2025Updated last year
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 2 years ago
- QGFN: Controllable Greediness with Action Values - Code☆11May 17, 2024Updated last year
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 5 months ago
- ☆13Mar 21, 2023Updated 2 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 2 years ago
- The Compositionality article class.☆13Jun 12, 2025Updated 8 months ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- Package to Train LANs (Likelihood approximation networks)☆13Jan 22, 2026Updated 3 weeks ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆10May 22, 2020Updated 5 years ago
- A package to convert range data from ROS range topics to pointclouds☆10Jun 30, 2017Updated 8 years ago
- ☆10Apr 5, 2022Updated 3 years ago
- Machine Learning Reading Group☆11Sep 15, 2023Updated 2 years ago
- ESfP: Event-based Shape from Polarization (CVPR 2023)☆18May 9, 2023Updated 2 years ago
- Summarization with Pointer-Generator Networks☆15Sep 1, 2020Updated 5 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆11Aug 13, 2023Updated 2 years ago
- Deploy remote development server of IntelliJ on Oracle's Ampere processors☆14Nov 25, 2022Updated 3 years ago
- A survey on machine learning for combinatorial optimization.☆13Dec 27, 2021Updated 4 years ago
- Spike sorter for electrophysiology data☆11Apr 27, 2018Updated 7 years ago
- ☆12Aug 28, 2020Updated 5 years ago
- Implementations of solutions to the continuous mountain car problem. Using OpenAI Gym and Tensorflow 1.1.☆11Jan 29, 2018Updated 8 years ago
- ☆12Oct 23, 2025Updated 3 months ago
- Distributional Successor Features Enable Zero-Shot Policy Optimization☆13Apr 11, 2025Updated 10 months ago
- The Pytorch implementation of Time aware and Feature similarity Self Attention in Vessel Fuel Consumption Prediction☆12Dec 15, 2021Updated 4 years ago
- Pallet loading problem solver with recursive partitioning approach for the packing of different rectangles in a rectangle.☆13Oct 1, 2012Updated 13 years ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated last year
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆51Jul 16, 2024Updated last year
- ☆15Apr 30, 2023Updated 2 years ago
- Hangman Game implementation using n-gram language model in NLP, achieved an accuracy of more than 50%☆13Jul 18, 2023Updated 2 years ago
- Evaluation of Neural Networks for Classifying Electroencephalography (EEG) Data☆11Mar 15, 2022Updated 3 years ago
- Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…☆11Aug 22, 2023Updated 2 years ago
- Tools for computational psychiatry research.☆11Dec 8, 2024Updated last year
- Research attempting to beat a naive dca with volatility forecasts and range position based weightings☆12Jul 9, 2021Updated 4 years ago
- ☆13May 21, 2023Updated 2 years ago
- 元强化学习MAML实现, 修改了部分老旧而不能运行的代码, 并可以通过render直接查看训练的结果☆11Dec 2, 2025Updated 2 months ago