mindspore-lab / mindrl
A high-performance, scalable MindSpore reinforcement learning framework.
☆45Updated 8 months ago
Alternatives and similar repositories for mindrl:
Users that are interested in mindrl are comparing it to the libraries listed below
- A Really Scalable RL Framework to 10k+ CPUs☆25Updated last year
- A simple 2D ball collision engine.☆12Updated last year
- ☆38Updated 10 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆250Updated 2 months ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆22Updated last month
- An easier PyTorch deep reinforcement learning library.☆188Updated 2 months ago
- ☆31Updated 2 months ago
- ☆161Updated last year
- GitHub's code repository is all you need☆345Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆51Updated last year
- ☆30Updated last year
- A distributed GPU-centric experience replay system for large AI models.☆17Updated last year
- rl-papers☆48Updated last year
- Reinforcement learning and planning for Minecraft.☆170Updated last year
- ☆62Updated last year
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆237Updated 2 months ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated last year
- ☆16Updated 3 years ago
- Online Decision Transformer☆249Updated last year
- A Massively Parallel Large Scale Self-Play Framework☆334Updated 2 years ago
- A large-scale multi-modal pre-trained model☆130Updated 2 years ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆360Updated 10 months ago
- A parallel framework for population-based multi-agent reinforcement learning.☆518Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆33Updated 3 months ago
- ☆31Updated last month
- MindSpore large-scale recommender system library.☆10Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆110Updated last month
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆159Updated last year
- xingtian is a componentized library for the development and verification of reinforcement learning algorithms☆310Updated last year
- ☆90Updated 2 years ago