YYCAAA/V-MPO_Lunarlander

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YYCAAA/V-MPO_Lunarlander)

YYCAAA / V-MPO_Lunarlander

Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238

☆48

Alternatives and similar repositories for V-MPO_Lunarlander

Users that are interested in V-MPO_Lunarlander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jsikyoon / V-MPO_torch
View on GitHub
V-MPO torch version with DMLab30 and GTrXL
☆13Mar 1, 2021Updated 5 years ago
wisnunugroho21 / reinforcement_learning_v_mpo
View on GitHub
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Oct 23, 2021Updated 4 years ago
daisatojp / mpo
View on GitHub
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆84Nov 19, 2022Updated 3 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
acyclics / MPO
View on GitHub
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆29Sep 10, 2020Updated 5 years ago
jerrodparker20 / adaptive-transformers-in-rl
View on GitHub
Adaptive Attention Span for Reinforcement Learning
☆136May 11, 2020Updated 6 years ago
alantess / gtrxl-torch
View on GitHub
Gated Transformer Model for Computer Vision
☆25Jul 11, 2021Updated 5 years ago
real-stanford / ASPiRe
View on GitHub
[NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning
☆13Oct 19, 2022Updated 3 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
tkipf / gym-gridworld
View on GitHub
Template for building 2D grid worlds with OpenAI Gym and Pycolab
☆14Jun 12, 2019Updated 7 years ago
prasoongoyal / PixL2R
View on GitHub
☆17Dec 21, 2020Updated 5 years ago
yunshiuan / tomnet-project
View on GitHub
This repo contains the ToMnet+ model for preference inference. Developed by Yun-Shiuan, Edwinn, Hsin-Yi, and Elaine.
☆10Feb 24, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dhruvramani / Transformers-RL
View on GitHub
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆183Feb 21, 2023Updated 3 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
feidieufo / homework
View on GitHub
Assignments for CS294-112.
☆30Sep 11, 2019Updated 6 years ago
eilab-gt / NovGrid
View on GitHub
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆34May 21, 2024Updated 2 years ago
uoe-agents / TED
View on GitHub
Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".
☆13Jan 25, 2023Updated 3 years ago
jacooba / hyper
View on GitHub
Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …
☆21Jul 31, 2024Updated last year
Wenminggong / PbRL_for_PHRI
View on GitHub
code for "Decoupled Preference-based Reinforcement Learning for Personalized Human-Robot Interaction"
☆11Jul 9, 2022Updated 4 years ago
jurgisp / memory-maze
View on GitHub
Evaluating long-term memory of reinforcement learning algorithms
☆180Jun 23, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ademiadeniji / irm
View on GitHub
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆42Jan 13, 2024Updated 2 years ago
suyoung-lee / LDM
View on GitHub
Latent Dynamics Mixture, NeurIPS 2021
☆18Oct 25, 2022Updated 3 years ago
HomebrewML / Olmax
View on GitHub
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Jan 20, 2024Updated 2 years ago
RodkinIvan / Transformer-RL
View on GitHub
Transformers (GTrXL & CoBERL) applied to RL tasks
☆29Aug 18, 2022Updated 3 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 5 years ago
Michael-Beukman / RobocupGym
View on GitHub
Reinforcement Learning inside a 3D soccer simulation
☆37Sep 15, 2024Updated last year
BruceGeLi / TCE_RL
View on GitHub
Temporally Correlated Episodic Reinforcement Learning, ICLR 24
☆12Apr 8, 2024Updated 2 years ago
polixir / causal-mbrl
View on GitHub
Toolkit of Causal Model-based Reinforcement Learning.
☆33Jun 5, 2023Updated 3 years ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
nathangrinsztajn / Box-World
View on GitHub
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆47Oct 3, 2023Updated 2 years ago
denisyarats / dmc2gym
View on GitHub
OpenAI Gym wrapper for the DeepMind Control Suite
☆229May 19, 2024Updated 2 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
timoklein / alphazero-gym
View on GitHub
AlphaZero for continuous control tasks
☆23Dec 7, 2022Updated 3 years ago
sparisi / cbet
View on GitHub
Change-Based Exploration Transfer
☆35Apr 24, 2022Updated 4 years ago
MaxDu17 / BehaviorRetrieval
View on GitHub
Code for the Behavior Retrieval Paper
☆35Jul 24, 2023Updated 3 years ago