XR-stb / DQN_WUKONGLinks
基于强化学习的黑神话悟空AI
☆84Updated 7 months ago
Alternatives and similar repositories for DQN_WUKONG
Users that are interested in DQN_WUKONG are comparing it to the libraries listed below
Sorting:
- DQN_play_sekiro☆563Updated last year
- AI demo for playing ARPG/Soul-like game with RL frame☆383Updated last year
- 从小说中提取对话数据集☆318Updated 4 months ago
- 强化学习玩超级马里奥☆84Updated 3 years ago
- A non-embedded AI for Clash Royale based on RL and CV.☆384Updated last year
- ☆55Updated 2 years ago
- ☆21Updated 10 months ago
- mcc_second_guandan☆97Updated 3 years ago
- Honor of Kings AI Open Environment of Tencent☆797Updated last year
- NLP_Study_Demo☆169Updated last year
- 基于stablebaseline3强化学习框架和gym-super-mario-bros马里奥游戏包,训练马里奥通关。☆183Updated last month
- DQN model used to train and beat Super Mario Bros. for the NES using PyTorch☆36Updated 3 years ago
- a simple project to beat boss in Blackmyth Wukong, using yolo8 to detect boss movement and a script to react to certain detections☆154Updated last year
- bilibili video course src code☆423Updated 2 years ago
- This is the code of using machine learning to play Sekiro .☆103Updated 4 years ago
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆71Updated 2 months ago
- 基于OpenVINO,本地部署大模型智能体Agent,控制TonyPi人形机器人☆153Updated 8 months ago
- Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。☆782Updated 8 months ago
- DQN_play_sekiro☆17Updated 2 years ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆72Updated last year
- Play atari Tennis game by dqn☆78Updated 3 years ago
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆106Updated last year
- use PPO Reinforcement Learning to play FlappyBird, code with pytorch☆27Updated 2 years ago
- 基于文心一言和树莓派Pico的最简易桌面宠物☆86Updated 4 months ago
- 从0开始,将chatgpt的技术路线跑一遍。☆272Updated last year
- 用基于策略梯度得强化学习方法训练AI玩王者荣耀☆1,800Updated 4 years ago
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year
- D3QN 强化学习打只狼☆31Updated 3 years ago
- simple decoder-only GTP model in pytorch☆42Updated last year
- ☆465Updated 7 months ago