yanxue7 / RL-LLM-PriorLinks
☆19Updated last month
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- SocialJax: sequential social dilemma environments☆41Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithms☆85Updated 2 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆105Updated 3 weeks ago
- Partially Observable Process Gym☆194Updated last month
- Multi-Agent Reinforcement Learning with JAX☆606Updated last week
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆256Updated 4 months ago
- Benchmarking RL generalization in an interpretable way.☆157Updated last month
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆230Updated last month
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆324Updated 10 months ago
- BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL alg…☆431Updated 3 weeks ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆175Updated 2 months ago
- ☆234Updated 8 months ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆343Updated last week
- Object Centric Atari games☆85Updated last month
- ☆73Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆174Updated 2 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆326Updated last week
- A tool for aggregating and plotting MARL experiment data.☆77Updated 6 months ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆202Updated last month
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆241Updated 5 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆150Updated 6 months ago
- ☆99Updated 4 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆144Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆173Updated 4 months ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆239Updated 3 months ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆35Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆43Updated 3 months ago
- Synthetic Experience Replay☆94Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆107Updated last year
- ☆93Updated last year