TARTRL / TLaunch
Launch programs on multiple hosts. (多机启动程序)
☆14Updated last year
Alternatives and similar repositories for TLaunch:
Users that are interested in TLaunch are comparing it to the libraries listed below
- A Really Scalable RL Framework to 10k+ CPUs☆25Updated last year
- An RL-Friendly Vision-Language Model for Minecraft☆30Updated 4 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 3 months ago
- ☆88Updated 2 years ago
- CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.☆106Updated 6 months ago
- A large-scale multi-modal pre-trained model☆130Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- Reinforcement learning and planning for Minecraft.☆170Updated last year
- Minimal RLHF implementation built on top of minGPT.☆29Updated 8 months ago
- RLHF implementation details of OAI's 2019 codebase☆183Updated last year
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆150Updated last year
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆104Updated last year
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆74Updated 11 months ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆61Updated last year
- This is the source code of Agar.io environment.☆23Updated 3 years ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆12Updated 11 months ago
- Official code repository for Prompt-DT.☆106Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 7 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 5 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆34Updated 10 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆41Updated 7 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆246Updated 6 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆159Updated last year
- ☆21Updated 4 years ago
- Selected list of papers on World Models that I found interesting and/or useful.☆20Updated last month
- A minimal and stable PPO.☆133Updated last year