jlin816 / homegridLinks

A minimal home grid world environment to evaluate language understanding in interactive agents.

☆22

Alternatives and similar repositories for homegrid

Users that are interested in homegrid are comparing it to the libraries listed below

Sorting:

UT-Austin-RPL / amago
a simple and scalable agent for training adaptive policies with sequence-based RL
☆131Updated last week
d5rlbenchmark / d5rl
☆28Updated last year
SonyResearch / simba
☆99Updated 4 months ago
seohongpark / METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆70Updated last year
FangchenLiu / MaskDP_public
Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022
☆44Updated last year
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆59Updated 9 months ago
dibyaghosh / icvf_release
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
☆89Updated last year
naumix / BiggerRegularizedOptimistic
Official implementation of the BRO algorithm
☆46Updated 5 months ago
seohongpark / HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆86Updated 7 months ago
mazpie / mastering-urlb
[ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …
☆40Updated last year
schmidtdominik / LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
☆116Updated 11 months ago
imgeorgiev / PWM
PWM: Policy Learning with Large World Models
☆53Updated 4 months ago
conglu1997 / v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆103Updated last year
jrobine / twm
Transformer-based World Models
☆83Updated 2 years ago
XuGW-Kevin / DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆76Updated last year
seohongpark / HILP
Foundation Policies with Hilbert Representations (ICML 2024)
☆89Updated last year
jeffacce / play-to-policy
From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data
☆54Updated 2 years ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆70Updated last year
mazpie / genrl
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆78Updated 3 months ago
quasimetric-learning / quasimetric-rl
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
☆46Updated last month
kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year
Alescontrela / viper_rl
Using advances in generative modeling to learn reward functions from unlabeled videos.
☆132Updated last year
suraj-nair-1 / lorel
☆39Updated 3 years ago
facebookresearch / gen_dgrl
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆28Updated 11 months ago
google-deepmind / dmc_vision_benchmark
☆26Updated last year
seohongpark / horizon-reduction
The official implementation of "Horizon Reduction Makes RL Scalable"
☆117Updated last month
seohongpark / PMA
Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)
☆32Updated 2 years ago
FLAIROx / jafar
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
☆67Updated 5 months ago
frankroeder / lanro-gym
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Updated 4 months ago
RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Updated last year