E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆17Dec 7, 2019Updated 6 years ago
Alternatives and similar repositories for e-maml
Users that are interested in e-maml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- openAI gym env for reversi/othello game☆20Nov 6, 2023Updated 2 years ago
- Meta-learning Gaussian process (GP) priors via PAC-Bayes bounds☆26Jan 25, 2024Updated 2 years ago
- ☆26Mar 16, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- video prediction and world model research☆14Jun 10, 2022Updated 3 years ago
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Nov 23, 2021Updated 4 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- ☆23Apr 2, 2024Updated 2 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆35May 17, 2019Updated 6 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- ☆44Oct 27, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 4 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- A Framework for Safe and Accelerated Reinforcement Learning-based Radio Resource Management☆20Oct 1, 2022Updated 3 years ago
- Proposed solution to the Flatland challenge (https://www.aicrowd.com/challenges/flatland-challenge), solving the Vehicle Rescheduling Pro…☆14Jan 22, 2020Updated 6 years ago
- ☆16Aug 7, 2021Updated 4 years ago
- [ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…☆10Dec 19, 2023Updated 2 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆69Jun 5, 2020Updated 5 years ago
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- A standard bare-bone ROS Gazebo simulator for the Franka Emika Panda robot built using inbuilt Gazebo ROS controllers and RobotHW interfa…☆11May 3, 2021Updated 4 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Apr 28, 2021Updated 4 years ago
- Behavioural cloning solution to MineRL2020 competition☆18Mar 6, 2021Updated 5 years ago
- ☆11Apr 12, 2020Updated 6 years ago
- Python library that prints a dict as PlantUML code.☆12Dec 8, 2022Updated 3 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Inverse Kinematics demystify☆13Jun 16, 2020Updated 5 years ago
- Implementation of bug1and bug2 algorithms☆18Apr 30, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- Highly scalable 2D JAX physics engine.☆64Feb 20, 2026Updated last month
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆90Feb 13, 2023Updated 3 years ago
- Meta RL codebase for Unstable Baselines☆22Dec 6, 2022Updated 3 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- ☆11Feb 23, 2016Updated 10 years ago