Itomigna2/Muesli-lunarlander

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Itomigna2/Muesli-lunarlander)

Itomigna2 / Muesli-lunarlander

Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)

☆20

Alternatives and similar repositories for Muesli-lunarlander

Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YuriCat / MuesliJupyterExample
View on GitHub
☆18Nov 4, 2021Updated 4 years ago
aielawady / relic
View on GitHub
☆12Sep 7, 2024Updated last year
L16H7 / lux-3-comets
View on GitHub
Multi-agent Reinforcement Learning, 14th in 701 teams - NeurIPS 2024 Competition
☆15Mar 13, 2025Updated last year
GGJJack / TCP-Socket
View on GitHub
A TCP Socket library for Deno
☆10Sep 19, 2023Updated 2 years ago
kigawas / flask-scaffold
View on GitHub
A scaffold to speed up launching a flask project.
☆15Jun 3, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
takuwwwo / LuxAI
View on GitHub
☆10Dec 3, 2022Updated 3 years ago
metekemertas / RobustBisimulation
View on GitHub
Learning bisimulation metrics for control, particularly suited to sparse reward settings
☆11Feb 28, 2023Updated 3 years ago
brytsknguyen / oblam_pgo
View on GitHub
An assignment on Loop Closure and Pose Graph Optimization for OBLAM CourseY
☆17Mar 18, 2023Updated 3 years ago
twni2016 / self-predictive-rl
View on GitHub
Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024
☆27Apr 26, 2026Updated 2 months ago
Mozilla-Ocho / formulaic-python
View on GitHub
The official Python library for Formulaic
☆18Apr 25, 2024Updated 2 years ago
camall3n / markov-state-abstractions
View on GitHub
Image-based gridworld experiment for learning Markov state abstractions
☆20Sep 16, 2024Updated last year
quasimetric-learning / torch-quasimetric
View on GitHub
PyTorch Package For Quasimetric Learning
☆51Oct 31, 2024Updated last year
TianhongDai / metaworld-sac
View on GitHub
☆12Aug 28, 2020Updated 5 years ago
andreyd41 / lux3-bot
View on GitHub
3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition
☆14Mar 16, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
simonw / scrape-fediverse
View on GitHub
Git scrapers for scraping the fediverse
☆24Updated this week
yunglau / QGFN
View on GitHub
QGFN: Controllable Greediness with Action Values - Code
☆11May 17, 2024Updated 2 years ago
WEIRDLabUW / dispo
View on GitHub
Distributional Successor Features Enable Zero-Shot Policy Optimization
☆15Apr 11, 2025Updated last year
boson-ai / rpgbench-public
View on GitHub
Evaluation of LLMs as RPG Game Engines
☆17May 15, 2025Updated last year
orrivlin / Hindsight-Experience-Replay---Bit-Flipping
View on GitHub
Simple bit flipping with sparse rewards using HER, similarly to the original paper
☆39Feb 25, 2019Updated 7 years ago
vballoli / vit-flax
View on GitHub
Implementation of Vision Transformers in Flax
☆18Oct 12, 2020Updated 5 years ago
ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
zoenguyenramirez / arc-prize-2024
View on GitHub
☆21Feb 22, 2025Updated last year
ido90 / RobustMetaRL
View on GitHub
A variant of Varibad that is robust to difficult tasks
☆11Aug 30, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gabe00122 / mapox-trainer
View on GitHub
Partially Observable Multi-Agent RL with Transformers
☆17Updated this week
daisatojp / mpo
View on GitHub
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆84Nov 19, 2022Updated 3 years ago
xihuai18 / awesome-RL-generalization
View on GitHub
A list of papers regarding generalization in (deep) reinforcement learning
☆11Aug 13, 2023Updated 2 years ago
EgOrlukha / MuJoCo-PyTorch
View on GitHub
PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment
☆12Feb 22, 2019Updated 7 years ago
awwang10 / sphinx
View on GitHub
☆14Oct 23, 2025Updated 8 months ago
rlglab / minizero
View on GitHub
[IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework
☆136Updated this week
facebookresearch / online-dt
View on GitHub
Online Decision Transformer
☆275Jan 22, 2024Updated 2 years ago
mwufi / meta-rl-bandits
View on GitHub
A simple RNN meta-learner
☆10Dec 17, 2018Updated 7 years ago
lucadellalib / actorch
View on GitHub
Deep reinforcement learning framework for fast prototyping based on PyTorch
☆14Mar 12, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
LDlabs / seqMultiTaskRNN
View on GitHub
sequential learning in orthogonal subspaces
☆14Nov 20, 2020Updated 5 years ago
WJ2003B / mqe-release
View on GitHub
Official Release of Multistep Quasimetric Estimation (MQE)
☆18Mar 13, 2026Updated 4 months ago
epignatelli / discovering-reinforcement-learning-algorithms
View on GitHub
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆23Dec 22, 2020Updated 5 years ago
solidiquis / novavim_go
View on GitHub
Vim made in Go that you shouldn't use.
☆17Mar 31, 2021Updated 5 years ago
IsaiahPressman / kaggle-lux-2024
View on GitHub
☆25Mar 17, 2025Updated last year
ykubo82 / bioCHL
View on GitHub
Neurons learn by predicting future activity
☆30Nov 4, 2021Updated 4 years ago
raphael-sch / map2seq_vln
View on GitHub
Code for ORAR Agent for Vision and Language Navigation on Touchdown and map2seq
☆20Nov 3, 2023Updated 2 years ago