Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Nov 10, 2020Updated 5 years ago
Alternatives and similar repositories for V-MPO_Lunarlander
Users that are interested in V-MPO_Lunarlander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Adaptive Attention Span for Reinforcement Learning☆136May 11, 2020Updated 5 years ago
- Gated Transformer Model for Computer Vision☆25Jul 11, 2021Updated 4 years ago
- [NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning☆13Oct 19, 2022Updated 3 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Template for building 2D grid worlds with OpenAI Gym and Pycolab☆14Jun 12, 2019Updated 6 years ago
- ☆17Dec 21, 2020Updated 5 years ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆17Jul 31, 2024Updated last year
- This repo contains the ToMnet+ model for preference inference. Developed by Yun-Shiuan, Edwinn, Hsin-Yi, and Elaine.☆10Feb 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Feb 21, 2023Updated 3 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Solving Complex Dexterous Manipulation Tasks with Trajectory Optimisation and Reinforcement Learning☆23May 16, 2021Updated 4 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35May 21, 2024Updated last year
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".☆13Jan 25, 2023Updated 3 years ago
- code for "Decoupled Preference-based Reinforcement Learning for Personalized Human-Robot Interaction"☆11Jul 9, 2022Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆167Jun 23, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Jan 13, 2024Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Transformers (GTrXL & CoBERL) applied to RL tasks☆30Aug 18, 2022Updated 3 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated last year
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆229May 19, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆47Oct 3, 2023Updated 2 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Reinforcement Learning inside a 3D soccer simulation☆37Sep 15, 2024Updated last year
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- ☆13Apr 25, 2024Updated last year