TARTRL / TLaunchLinks
Launch programs on multiple hosts. (多机启动程序)
☆14Updated 2 years ago
Alternatives and similar repositories for TLaunch
Users that are interested in TLaunch are comparing it to the libraries listed below
Sorting:
- A Really Scalable RL Framework to 10k+ CPUs☆32Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆58Updated last year
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- ☆89Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 10 months ago
- A collection of LLM with RL papers☆276Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆17Updated 9 months ago
- A Massively Parallel Large Scale Self-Play Framework☆351Updated 2 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 11 months ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆120Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆187Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 9 months ago
- ☆22Updated 5 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- ☆79Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year
- ☆35Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆123Updated 7 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆267Updated 10 months ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆117Updated 10 months ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆157Updated 2 years ago
- Code for Contrastive Preference Learning (CPL)☆173Updated 7 months ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.☆27Updated last month
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆64Updated 2 years ago