Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
☆28Nov 27, 2024Updated last year
Alternatives and similar repositories for optim4rl
Users that are interested in optim4rl are comparing it to the libraries listed below
Sorting:
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- an environment based on XLA for deep learning compiler optimization research.☆24Mar 7, 2023Updated 2 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- Average-Reward Reinforcement Learning with Trust Region Methods☆11Oct 17, 2022Updated 3 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82May 13, 2024Updated last year
- Sandbox environment for generalizable agent research☆27Aug 19, 2022Updated 3 years ago
- Reinforcement learning library in JAX.☆101Oct 22, 2023Updated 2 years ago
- Curated list of JAX Resources and Packages☆29Updated this week
- Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023☆13Aug 3, 2023Updated 2 years ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- A benchmark library for Dynamic Algorithm Configuration.☆34Feb 23, 2026Updated last week
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- ☆37Apr 27, 2023Updated 2 years ago
- Memory-Based Meta-Learning on Non-Stationary Distributions☆17Mar 14, 2024Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Oct 23, 2022Updated 3 years ago
- A reinforcement learning algorithm for the 2048 game☆20Mar 25, 2014Updated 11 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Jan 13, 2024Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆24Apr 7, 2024Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- ☆17Dec 4, 2019Updated 6 years ago
- Vectorization techniques for fast population-based training.☆57Aug 12, 2022Updated 3 years ago
- ☆59Sep 22, 2022Updated 3 years ago
- RL Environments in JAX 🌍☆864May 30, 2025Updated 9 months ago
- ☆27Jun 6, 2024Updated last year
- ☆19Nov 25, 2022Updated 3 years ago
- ☆21Dec 22, 2020Updated 5 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆61Oct 23, 2023Updated 2 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- (ICLR 2025) Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization☆27Sep 5, 2024Updated last year
- Image-based gridworld experiment for learning Markov state abstractions☆21Sep 16, 2024Updated last year
- An implementation of MuZero in JAX.☆57Nov 8, 2022Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Rewarded soups official implementation☆62Sep 27, 2023Updated 2 years ago
- qmix☆23May 28, 2020Updated 5 years ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆25Feb 16, 2023Updated 3 years ago