lyingCS / ruc_gsai_rlLinks
这是中国人民大学高瓴人工智能学院本科课程《强化学习》的期末项目安排,项目内容是训练一个适用于国标麻将的强化学习智能体。
☆22Updated last year
Alternatives and similar repositories for ruc_gsai_rl
Users that are interested in ruc_gsai_rl are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆188Updated 3 months ago
- 南京大学人工智能学院本科生开放日面试经验分享☆35Updated 7 months ago
- A non-embedded AI for Clash Royale based on RL and CV.☆383Updated last year
- Honor of Kings AI Open Environment of Tencent☆789Updated last year
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆66Updated last month
- [ACL 2025] Cross-Lingual Pitfalls: Automatic Probing Cross-Lingual Weakness of Multilingual Large Language Models☆42Updated 7 months ago
- PKU course, Reinforced Learning, final project☆27Updated 4 years ago
- ☆24Updated 3 months ago
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆115Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆295Updated 8 months ago
- Playing Hollow Knight with reinforcement learning.☆110Updated 2 years ago
- AI demo for playing ARPG/Soul-like game with RL frame☆377Updated last year
- all the notes, ppts and homework for CS224n☆134Updated last year
- 复旦大学2025级研究生新生入学教育测试☆72Updated 4 months ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆54Updated 9 months ago
- ☆58Updated last year
- Summer Training 2023, SAST 9.☆44Updated 2 years ago
- 美赛爬虫,美国大学生数学建模竞赛证书爬取及信息OCR识别分析☆18Updated 3 years ago
- ☆58Updated 6 months ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆467Updated last year
- Run TRex with PPO☆39Updated 8 months ago
- modern AI for beginners☆188Updated 4 months ago
- ☆54Updated last year
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆52Updated last week
- A Survey on Large Language Model-Based Game Agents☆802Updated 2 months ago
- 在没有sudo权限的情况下,在linux上使用clash☆169Updated last year
- Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time☆52Updated last year
- Experiment task scheduling made easy.☆30Updated last week
- NJUAI-Master-Courses☆30Updated 2 years ago
- DeepBattler - Your BEST LLM Battlegrounds Coach/Friend!☆286Updated last month