TARTRL / TLaunchLinks
Launch programs on multiple hosts. (多机启动程序)
☆14Updated 2 years ago
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A large-scale multi-modal pre-trained model☆133Updated 2 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆38Updated last year
- A Massively Parallel Large Scale Self-Play Framework☆361Updated 3 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Updated 2 years ago
- A collection of LLM with RL papers☆278Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆196Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆295Updated 8 months ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- ☆91Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- ☆25Updated 3 years ago
- Online Decision Transformer☆274Updated last year
- Re-implementations of SOTA RL algorithms.☆136Updated 2 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- ☆12Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago
- [NeurIPS 2022] 1st Place Solution for the 3rd Neural MMO Challenge☆29Updated 3 years ago
- Agent Learning Framework https://alf.readthedocs.io☆356Updated last month
- A parallel framework for population-based multi-agent reinforcement learning.☆546Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆64Updated 2 years ago
- This project is implementation code of AlphaStar☆204Updated last year
- ☆248Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- ☆89Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆166Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆275Updated 2 months ago
- ☆14Updated last year
- RLA is a tool for managing your RL experiments automatically☆31Updated last year