yanxue7 / RL-LLM-PriorLinks
☆21Updated 2 months ago
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- SocialJax: sequential social dilemma environments☆44Updated last month
- Benchmarking RL generalization in an interpretable way.☆161Updated 2 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆71Updated last year
- Object Centric Atari games☆88Updated last month
- Unified Implementations of Offline Reinforcement Learning Algorithms☆92Updated 4 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆109Updated 2 months ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆180Updated 3 months ago
- ☆104Updated 6 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆182Updated 5 months ago
- ☆43Updated 2 years ago
- Partially Observable Process Gym☆198Updated 2 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆35Updated 2 years ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆32Updated 2 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆27Updated 4 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆146Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆103Updated 10 months ago
- ☆235Updated 9 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- A tool for aggregating and plotting MARL experiment data.☆77Updated 7 months ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 5 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆107Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆147Updated last year
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated last year
- off-policy RL on long sequences☆135Updated 2 weeks ago
- Synthetic Experience Replay☆100Updated last year
- Continual RL with wold models (Collas 2023)☆19Updated last year
- Datasets with baselines for Offline MARL.☆177Updated last week
- Collection of resources on plasticity loss in deep reinforcement learning☆19Updated 9 months ago