roger-creus / ale-nlLinks
A framework for evaluating LLMs in Atari games
☆15Updated 2 months ago
Alternatives and similar repositories for ale-nl
Users that are interested in ale-nl are comparing it to the libraries listed below
Sorting:
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆13Updated 3 weeks ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆103Updated last year
- Goal-Conditioned Reinforcement Learning with JAX☆171Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithms☆80Updated 2 months ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆75Updated last year
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆19Updated last year
- Jax/Flax Implementation of TD-MPC2☆62Updated last week
- ☆48Updated 7 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆99Updated 10 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Official implementation of the BRO algorithm☆45Updated 4 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆112Updated 10 months ago
- ☆93Updated 4 months ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆13Updated 7 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆85Updated 6 months ago
- ☆47Updated 2 years ago
- ☆25Updated 10 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆191Updated last week
- Code for "TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning"☆26Updated last year
- ☆7Updated 2 years ago
- Transformer-based World Models☆82Updated 2 years ago
- ☆19Updated last month
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆82Updated last year
- ☆92Updated last year
- Collection of resources on plasticity loss in deep reinforcement learning☆18Updated 7 months ago
- ☆28Updated last year