openpsi-projects / srl
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
☆13Updated 4 months ago
Related projects: ⓘ
- Implementation of Multi-Game Decision Transformers in PyTorch☆42Updated last year
- A minimal and stable PPO.☆96Updated 7 months ago
- ☆86Updated 2 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆15Updated 6 months ago
- Transformer-based World Models☆66Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆49Updated 8 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- Official code repository for Prompt-DT.☆93Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 2 years ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆79Updated this week
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆90Updated last year
- Repo for Implicit Diffusion Q-Learning☆85Updated 9 months ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆65Updated last week
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆29Updated 4 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆37Updated 5 months ago
- ☆43Updated 3 months ago
- Benchmarked implementations of Offline RL Algorithms.☆62Updated 4 months ago
- ☆23Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆47Updated last year
- ☆20Updated 11 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆45Updated 11 months ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆77Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆28Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆161Updated last week
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆104Updated last year