lifelong-learning-systems / meta-arcade

MetaArcade is a configurable environment suite for meta-learning

☆14

Alternatives and similar repositories for meta-arcade:

Users that are interested in meta-arcade are comparing it to the libraries listed below

qlan3 / Jaxplorer
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
☆12Updated 9 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆72Updated 8 months ago
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
hamishs / JAX-RL
JAX implementations of various deep reinforcement learning algorithms.
☆21Updated 2 months ago
FLAIROx / popjym
POPGym Library in JAX
☆11Updated last year
tseyde / decqn
☆36Updated 2 years ago
MyNameIsArko / RL-Flax
Various reinforcement learning algorithms written in Jax + Flax
☆24Updated last year
ethanluoyc / corax
Corax: Core RL in JAX
☆37Updated last year
uoe-agents / reading-group
Propose & vote on reading group papers in the "Discussions" tab.
☆12Updated last year
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆97Updated 5 months ago
cassidylaidlaw / effective-horizon
Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"
☆47Updated 9 months ago
JesseFarebro / distributional-sr
Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".
☆20Updated 5 months ago
jsikyoon / V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
☆13Updated 4 years ago
tinker495 / jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆48Updated this week
twitter-research / hyperbolic-rl
☆56Updated 2 years ago
info-structures / ais
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆20Updated 3 years ago
CognitiveModeling / THICK
☆16Updated last year
pairlab / vagram
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆24Updated 2 years ago
ikostrikov / dmcgym
☆23Updated 2 years ago
ikostrikov / jaxrl2
☆47Updated 2 years ago
brownirl / lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆17Updated 5 months ago
ahmed-touati / controllable_agent
☆44Updated last year
boschresearch / ube-mbrl
Model-Based Uncertainty in Value Functions (AISTATS2023)
☆18Updated 2 years ago
Howuhh / sac-n-jax
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆52Updated last year
RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Updated last year
YYCAAA / V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆45Updated 4 years ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆100Updated last year
kenjyoung / dreamerv2_JAX
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆15Updated 2 years ago
aliang8 / varibad_jax
☆10Updated 9 months ago
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆141Updated last year