TARTRL / TLaunch
Launch programs on multiple hosts. (多机启动程序)
☆14Updated last year
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A Really Scalable RL Framework to 10k+ CPUs☆33Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆54Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- ☆88Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆56Updated 7 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 10 months ago
- ☆29Updated last year
- Minimal RLHF implementation built on top of minGPT.☆28Updated 10 months ago
- A large-scale multi-modal pre-trained model☆131Updated 2 years ago
- This is the source code of Agar.io environment.☆23Updated 3 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆142Updated last year
- ☆25Updated 2 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆57Updated 2 years ago
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆22Updated 2 weeks ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆112Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆13Updated 7 months ago
- A distributed GPU-centric experience replay system for large AI models.☆18Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆72Updated 2 months ago
- ☆14Updated last year
- Implementation of the paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆16Updated 7 months ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆11Updated 2 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year