Shanghai-Digital-Brain-Laboratory / BDM-DB1
A large-scale multi-modal pre-trained model
☆128Updated last year
Related projects ⓘ
Alternatives and complementary repositories for BDM-DB1
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆130Updated 3 years ago
- ☆86Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆151Updated last year
- ☆69Updated 9 months ago
- A collection of offline reinforcement learning algorithms.☆157Updated 5 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 10 months ago
- ☆16Updated 2 years ago
- ☆158Updated last year
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆161Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆90Updated last year
- ☆41Updated 2 years ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- ☆22Updated 10 months ago
- ☆106Updated last year
- ☆28Updated last year
- Overcooked human-AI experiment platform☆30Updated 11 months ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆112Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- ☆39Updated 2 years ago
- A collection of LLM with RL papers☆230Updated 6 months ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆95Updated 10 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆114Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆165Updated last month
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- Reinforcement learning and planning for Minecraft.☆158Updated 8 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆113Updated last year