☆24Aug 9, 2024Updated last year
Alternatives and similar repositories for new-actions-rl
Users that are interested in new-actions-rl are comparing it to the libraries listed below
Sorting:
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 3 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- ☆10Jun 27, 2024Updated last year
- Implements the Messenger environment and EMMA model.☆25Jun 14, 2023Updated 2 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- ☆11Jan 21, 2020Updated 6 years ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 3 months ago
- ☆16Jul 16, 2024Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆34Sep 18, 2024Updated last year
- Pointax: PointMaze Environment for JAX☆26Oct 22, 2025Updated 4 months ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 7 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- Deep learning models for contextual multi-armed bandit setting☆13May 16, 2021Updated 4 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆26Jan 14, 2025Updated last year
- ☆19May 20, 2025Updated 9 months ago
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 4 months ago
- ☆18Aug 20, 2025Updated 6 months ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- High-performance JAX-powered simulator for robotic navigation in 2D mazes, optimized for Quality-Diversity algorithm research and benchma…☆20Jun 19, 2025Updated 8 months ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- ☆22Jan 14, 2021Updated 5 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- ☆22May 12, 2025Updated 9 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- A collection of Reinforcement Learning implementations with PyTorch☆22Mar 22, 2022Updated 3 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Dec 29, 2023Updated 2 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆27Jan 14, 2026Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- minimal Energy-based transformer☆43Dec 11, 2025Updated 2 months ago
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆31Mar 2, 2026Updated last week
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆31Nov 4, 2024Updated last year
- Multi Task RL Baselines☆262Dec 31, 2021Updated 4 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago