XR-stb / DQN_WUKONGLinks
基于强化学习的黑神话悟空AI
☆77Updated 2 months ago
Alternatives and similar repositories for DQN_WUKONG
Users that are interested in DQN_WUKONG are comparing it to the libraries listed below
Sorting:
- DQN_play_sekiro☆537Updated 11 months ago
- AI demo for playing ARPG/Soul-like game with RL frame☆354Updated 11 months ago
- 从小说中提取对话数据集☆239Updated last year
- 人工智能模型玩王者荣耀☆221Updated 4 months ago
- 基于OpenVINO,本地部署大模型智能体Agent,控制TonyPi人形机器人☆144Updated 2 months ago
- Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。☆727Updated 3 months ago
- 强化学习玩超级马里奥☆76Updated 3 years ago
- ☆18Updated 5 months ago
- a simple project to beat boss in Blackmyth Wukong, using yolo8 to detect boss movement and a script to react to certain detections☆150Updated 11 months ago
- 基于stablebaseline3强化学习框架和gym-super-mario-bros马里奥游戏包,训练马里奥通关。☆123Updated 2 months ago
- use PPO Reinforcement Learning to play FlappyBird, code with pytorch☆25Updated 2 years ago
- [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models☆476Updated 7 months ago
- 🚀WebUI integrated platform for latest LLMs | 各大语言模 型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用…☆547Updated 9 months ago
- This is the code of using machine learning to play Sekiro .☆102Updated 4 years ago
- 【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集☆221Updated 4 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆496Updated 10 months ago
- Using deep reinforcement learning to play Snake game(贪吃蛇).☆81Updated 3 years ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆68Updated last year
- 文本语料转训练集工具,txt转dataset☆93Updated last year
- 这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中☆25Updated last year
- The plan which extend ChatHaruhi into Zero-shot Roleplaying model☆108Updated last year
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆52Updated 3 weeks ago
- ☆248Updated 3 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆567Updated last year
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,586Updated last year
- DQN_play_sekiro☆15Updated last year
- ☆308Updated last year
- 复现大模型相关算法及一些学习记录☆2,069Updated last month
- OCR自动化阅卷项目☆332Updated last year