lyingCS / ruc_gsai_rlLinks
这是中国人民大学高瓴人工智能学院本科课程《强化学习》的期末项目安排,项目内容是训练一个适用于国标麻将的强化学习智能体。
☆20Updated last year
Alternatives and similar repositories for ruc_gsai_rl
Users that are interested in ruc_gsai_rl are comparing it to the libraries listed below
Sorting:
- Honor of Kings AI Open Environment of Tencent☆778Updated last year
- AI demo for playing ARPG/Soul-like game with RL frame☆371Updated last year
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆184Updated last month
- A non-embedded AI for Clash Royale based on RL and CV.☆364Updated last year
- DQN_play_sekiro☆556Updated last year
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆61Updated 2 weeks ago
- ☆54Updated last year
- 南京大学人工智能学院本科生开放日面试经验分享☆34Updated 6 months ago
- PKU course, Reinforced Learning, final project☆27Updated 4 years ago
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆110Updated last year
- ☆16Updated 8 months ago
- Computer Vision(04711432) | Peking Univ. ECE Course Materials☆15Updated 3 years ago
- Run TRex with PPO☆39Updated 6 months ago
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆162Updated 4 years ago
- Douzero with ResNet and GPU support for Windows☆46Updated 3 years ago
- 机器人走迷宫,Pytorch,强化学习,DQN。☆98Updated 4 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆50Updated 8 months ago
- ☆36Updated 6 months ago
- Deep Learning For Computer Vision Winter 2022 By Prof. Justin Johnson☆24Updated 3 years ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆160Updated last year
- Learn to play Sekiro with reinforcement learning.☆17Updated 3 years ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆21Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆143Updated 7 months ago
- Baseline for NeurIPS_Auto_Bidding_General_Track☆38Updated last year
- Playing Hollow Knight with reinforcement learning.☆105Updated 2 years ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆52Updated 7 months ago
- 北京交通大学本科毕设latex模板(非官方)☆53Updated 3 years ago
- all the notes, ppts and homework for CS224n☆129Updated last year
- ☆49Updated 7 months ago