XR-stb / DQN_WUKONGLinks
基于强化学习的黑神话悟空AI
☆78Updated 3 months ago
Alternatives and similar repositories for DQN_WUKONG
Users that are interested in DQN_WUKONG are comparing it to the libraries listed below
Sorting:
- AI demo for playing ARPG/Soul-like game with RL frame☆354Updated 11 months ago
- DQN_play_sekiro☆538Updated last year
- 从小说中提取对话数据集☆246Updated last week
- 强化学习玩超级马里奥☆77Updated 3 years ago
- ☆18Updated 6 months ago
- ☆259Updated last month
- A non-embedded AI for Clash Royale based on RL and CV.☆309Updated last year
- use PPO Reinforcement Learning to play FlappyBird, code with pytorch☆26Updated 2 years ago
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆53Updated last month
- 🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用…☆548Updated 10 months ago
- Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。☆735Updated 3 months ago
- 基于大语言模型(LLM)和多智能体(Multi-Agent),探究AI写小说能力的边界☆327Updated last year
- Sample GLM4V + ChatTTS AI assistant☆85Updated last year
- 基于stablebaseline3强化学习框架和gym-super-mario-bros马里奥游戏包,训练马里奥通关。☆133Updated 3 months ago
- NLP_Study_Demo☆159Updated last year
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆103Updated last year
- fine-tune deepseek r1☆123Updated 7 months ago
- Phi3 中文后训练模型仓库☆322Updated 9 months ago
- [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models☆478Updated 8 months ago
- a simple project to beat boss in Blackmyth Wukong, using yolo8 to detect boss movement and a script to react to certain detections☆150Updated last year
- The plan which extend ChatHaruhi into Zero-shot Roleplaying model☆108Updated last year
- 《大模型项目实战:多领域智能应用开发》配套资源☆173Updated last week
- 一种快速、轻松的AI辅助标注工具LabelQuick☆268Updated 7 months ago
- 基于已有基座模型微调的算命大模型☆183Updated last year
- Play atari Tennis game by dqn☆76Updated 3 years ago
- 基于OpenVINO,本地部署大模型智能体Agent,控制TonyPi人形机器人☆144Updated 3 months ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆512Updated last year
- bilibili video course src code☆375Updated last year
- MCP Server for the Bilibili API, supporting various operations.☆157Updated 4 months ago
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year