TARTRL / TLaunchLinks
Launch programs on multiple hosts. (多机启动程序)
☆14Updated 2 years ago
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A Really Scalable RL Framework to 10k+ CPUs☆38Updated last year
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- A collection of LLM with RL papers☆278Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆192Updated last year
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- ☆12Updated last year
- ☆91Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆25Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- A Massively Parallel Large Scale Self-Play Framework☆358Updated 2 years ago
- ☆12Updated 3 years ago
- A python module designed for agile RL algorithm developing.☆26Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆292Updated 7 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆117Updated last year
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Updated 3 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Updated 8 months ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆24Updated last year
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆38Updated last year
- Official code repository for Prompt-DT.☆117Updated 3 years ago
- [AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"☆31Updated 9 months ago
- Personal Repo to keep track of RL papers☆31Updated 4 years ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆324Updated 7 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,208Updated last year
- RLHF implementation details of OAI's 2019 codebase☆196Updated last year
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 6 years ago