TARTRL / TLaunchLinks
Launch programs on multiple hosts. (多机启动程序)
☆14Updated 2 years ago
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A Really Scalable RL Framework to 10k+ CPUs☆38Updated last year
- A collection of LLM with RL papers☆278Updated last year
- A large-scale multi-modal pre-trained model☆133Updated 2 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Updated 2 years ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆198Updated last year
- ☆25Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆72Updated 2 years ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆301Updated 9 months ago
- A Massively Parallel Large Scale Self-Play Framework☆361Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- ☆14Updated last year
- ☆91Updated 3 years ago
- ☆12Updated 3 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆67Updated 2 years ago
- ☆12Updated last year
- Online Decision Transformer☆273Updated 2 years ago
- Personal Repo to keep track of RL papers☆31Updated 4 years ago
- Re-implementations of SOTA RL algorithms.☆136Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆137Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆275Updated 3 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Updated 10 months ago
- ☆250Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆54Updated last year
- ☆148Updated last year
- Keeping track of RL experiments☆166Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year