TARTRL / TLaunchLinks
Launch programs on multiple hosts. (多机启动程序)
☆14Updated last year
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A Really Scalable RL Framework to 10k+ CPUs☆33Updated last year
- ☆89Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆58Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆29Updated 11 months ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 10 months ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- A distributed GPU-centric experience replay system for large AI models.☆18Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆39Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆58Updated 8 months ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆120Updated 2 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆64Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- ☆18Updated 6 years ago
- ☆30Updated last year
- Official code repository for Prompt-DT.☆112Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆88Updated 2 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆108Updated 2 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆114Updated 9 months ago
- ☆80Updated 7 months ago
- A minimal and stable PPO.☆138Updated last year
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆27Updated last week
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆35Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆73Updated 3 months ago