roger-creus / ale-nlLinks

A framework for evaluating LLMs in Atari games

☆15

Alternatives and similar repositories for ale-nl

Users that are interested in ale-nl are comparing it to the libraries listed below

Sorting:

seohongpark / ogbench
A benchmark for offline goal-conditioned RL and offline RL
☆196Updated 2 weeks ago
MichalBortkiewicz / JaxGCRL
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆173Updated 2 months ago
conglu1997 / v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆103Updated last year
adityab / CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆76Updated last year
SonyResearch / simba
☆99Updated 4 months ago
Improbable-AI / random-latent-exploration
☆25Updated 10 months ago
EmptyJackson / unifloral
Unified Implementations of Offline Reinforcement Learning Algorithms
☆85Updated 2 months ago
jypark0 / bmil
☆11Updated 2 years ago
nakamotoo / Cal-QL
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
☆102Updated 11 months ago
Probabilistic-and-Interactive-ML / awesome-plasticity-loss
Collection of resources on plasticity loss in deep reinforcement learning
☆19Updated 8 months ago
FrankZheng2022 / TACO
Code for "TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning"
☆26Updated last year
nicklashansen / dmcontrol-generalization-benchmark
DMControl Generalization Benchmark
☆173Updated last year
OffDynamicsRL / off-dynamics-rl
☆49Updated 7 months ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆144Updated last year
automl / CARL
Benchmarking RL generalization in an interpretable way.
☆157Updated last month
ikostrikov / rlpd
☆301Updated 2 years ago
lilucse / SparseNetwork4DRL
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆19Updated last month
timoklein / redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
☆28Updated 8 months ago
jrobine / twm
Transformer-based World Models
☆83Updated 2 years ago
ShaneFlandermeyer / tdmpc2-jax
Jax/Flax Implementation of TD-MPC2
☆65Updated 3 weeks ago
RajGhugare19 / stitching-is-combinatorial-generalisation
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆23Updated last year
young-geng / CQL
Conservative Q Learning on top of SAC
☆132Updated 2 years ago
naumix / BiggerRegularizedOptimistic
Official implementation of the BRO algorithm
☆46Updated 5 months ago
weipu-zhang / STORM
☆93Updated last year
geon-hyeong / imitation-dice
☆7Updated 2 years ago
jsikyoon / dreamer-torch
Pytorch version of Dreamer, which follows the original TF v2 codes.
☆129Updated 3 years ago
jhejna / inverse-preference-learning
☆41Updated 2 years ago
XuGW-Kevin / DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆76Updated last year
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆173Updated 3 months ago
aalmuzairee / dmcgb2
Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)
☆20Updated last year