TARTRL / TLaunchLinks
Launch programs on multiple hosts. (多机启动程序)
☆14Updated 2 years ago
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A Really Scalable RL Framework to 10k+ CPUs☆34Updated last year
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- A collection of LLM with RL papers☆277Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆192Updated last year
- ☆89Updated 3 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆37Updated last year
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- A Massively Parallel Large Scale Self-Play Framework☆354Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 10 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆126Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆317Updated 5 months ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- RLHF implementation details of OAI's 2019 codebase☆190Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆288Updated 5 months ago
- ☆84Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- ☆12Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Official code repository for Prompt-DT.☆115Updated 3 years ago
- ☆12Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Updated 11 months ago
- Code for Contrastive Preference Learning (CPL)☆175Updated 10 months ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Updated 3 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 11 months ago
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆230Updated 2 weeks ago
- ☆25Updated 3 years ago
- A set of competitive environments for Reinforcement Learning research.☆29Updated 2 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Updated last year