roger-creus / ale-nl
A framework for evaluating LLMs in Atari games
☆15Updated 2 weeks ago
Alternatives and similar repositories for ale-nl:
Users that are interested in ale-nl are comparing it to the libraries listed below
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- ☆44Updated 5 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 11 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆153Updated last week
- A benchmark for offline goal-conditioned RL and offline RL☆162Updated last month
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆19Updated 11 months ago
- ☆47Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- ☆82Updated 2 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆70Updated 11 months ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆13Updated 6 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆82Updated 5 months ago
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆26Updated 6 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆99Updated 9 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆92Updated 9 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- Transformer-based World Models☆81Updated 2 years ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆67Updated last week
- ☆24Updated 8 months ago
- Jax/Flax Implementation of TD-MPC2☆61Updated last week
- Simple maze environments using mujoco-py☆54Updated last year
- ☆10Updated last year
- Official implementation of the BRO algorithm☆42Updated 3 months ago
- ☆18Updated 3 months ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- ☆11Updated 2 years ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆85Updated last year
- [AAMAS 2024] Code for the paper "MaDi: Learning to Mask Distractions for Generalization in Visual Deep RL"☆25Updated last month
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆38Updated 10 months ago
- Skeleton for scalable and flexible Jax RL implementations☆80Updated last year