yanxue7 / RL-LLM-PriorLinks
☆24Updated 5 months ago
Alternatives and similar repositories for RL-LLM-Prior
Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below
Sorting:
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆79Updated last year
- ☆242Updated last year
- Benchmarking RL generalization in an interpretable way.☆170Updated last week
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Updated 2 years ago
- Partially Observable Process Gym☆207Updated 5 months ago
- SocialJax: sequential social dilemma environments☆53Updated last week
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆123Updated 5 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆125Updated last month
- Object Centric Atari games☆94Updated last month
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆34Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆154Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆220Updated last year
- ☆42Updated 2 years ago
- Datasets with baselines for Offline MARL.☆188Updated 3 weeks ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆338Updated last year
- Representation Learning for RL☆128Updated 2 years ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆39Updated 5 months ago
- A tool for aggregating and plotting MARL experiment data.☆80Updated 10 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆222Updated this week
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆26Updated 3 months ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆36Updated 2 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆46Updated 8 months ago
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆31Updated last year
- ☆112Updated 9 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- Synthetic Experience Replay☆106Updated last year
- ☆31Updated 2 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago