TARTRL / TLaunch
Launch programs on multiple hosts. (多机启动程序)
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TLaunch
- ☆86Updated 2 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆18Updated 8 months ago
- A large-scale multi-modal pre-trained model☆128Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆11Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- An RL-Friendly Vision-Language Model for Minecraft☆26Updated last month
- This is the source code of Agar.io environment.☆23Updated 3 years ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- ☆80Updated 5 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- a simple and scalable agent for training adaptive policies with sequence-based RL☆92Updated this week
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆13Updated 6 months ago
- Reinforcement learning and planning for Minecraft.☆158Updated 8 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆43Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆44Updated 3 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆123Updated this week
- ☆45Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- A collection of LLM with RL papers☆230Updated 6 months ago
- ☆201Updated this week
- RLA is a tool for managing your RL experiments automatically☆70Updated last year
- ☆21Updated 4 years ago
- Transformer-based World Models☆71Updated last year