roger-creus / ale-nlLinks
A framework for evaluating LLMs in Atari games
☆15Updated 7 months ago
Alternatives and similar repositories for ale-nl
Users that are interested in ale-nl are comparing it to the libraries listed below
Sorting:
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆193Updated this week
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112Updated last year
- A benchmark for offline goal-conditioned RL and offline RL☆285Updated last month
- ☆112Updated 9 months ago
- Official implementation of the BRO algorithm☆51Updated 10 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆114Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆80Updated last year
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆31Updated last year
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆27Updated 2 years ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆125Updated last month
- Foundation Policies with Hilbert Representations (ICML 2024)☆102Updated 2 months ago
- Transformer-based World Models☆86Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Updated 4 months ago
- Skeleton for scalable and flexible Jax RL implementations☆92Updated 2 years ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆39Updated 5 months ago
- ☆116Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆91Updated 11 months ago
- ☆56Updated 3 weeks ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆13Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated 2 years ago
- ☆28Updated last year
- Jax/Flax Implementation of TD-MPC2☆66Updated this week
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆48Updated last year
- Prioritized Generative Replay (ICLR 2025 Oral)☆21Updated 8 months ago
- Conservative Q learning in Jax☆56Updated 2 years ago
- ☆42Updated 2 years ago
- off-policy RL on long sequences☆149Updated 3 months ago
- ☆37Updated 2 months ago