waylandzhang / learn-reinforcement-learning
《Reinforcement Learning》读书学习与视频分享笔记
☆48Updated 3 weeks ago
Alternatives and similar repositories for learn-reinforcement-learning:
Users that are interested in learn-reinforcement-learning are comparing it to the libraries listed below
- ☆366Updated 11 months ago
- ☆191Updated this week
- A simple and trans-platform rag framework and tutorial☆173Updated last month
- ☆38Updated last month
- 通过带领大家解读Transformer模型来加深对模型的理解☆178Updated last month
- LLM/MLOps/LLMOps☆84Updated 7 months ago
- 解锁HuggingFace生态的百般用法☆89Updated 4 months ago
- 人工智能培训课件资源☆83Updated this week
- pretrain a wiki llm using transformers☆37Updated 7 months ago
- 基于文心一言和树莓派Pico的最简易桌面宠物☆71Updated 2 months ago
- 《自然语言处理:大模型理论与实践》配套数据和代码☆61Updated 3 months ago
- This is a multi agent tutorial based on the CAMEL framework, aimed at understanding how to build an Agent Society from the ground up!☆179Updated this week
- 通义千问的DPO训练☆46Updated 7 months ago
- 大模型/LLM推理和部署理论与实践☆244Updated last month
- 《大模型项目实战:多领域智能应用开发》配套资源☆130Updated last week
- A simple and trans-platform agent framework and tutorial☆91Updated this week
- 从零到一实现一个 miniLLM~(动手学习LLM)☆65Updated 11 months ago
- wow-fullstack,令人惊叹的全栈开发教程☆170Updated 3 weeks ago
- ☆75Updated 2 months ago
- 本仓库将带大家从零开始,用pytorch的线性层搭建传统的NLP神经网络☆35Updated 4 months ago
- ☆22Updated last month
- 尝试自己从头写一个LLM,参考llama和nanogpt☆58Updated 11 months ago
- 异步图书:《 GPT图解 大模型是怎样构建的》☆131Updated last year
- Qwen2.5 0.5B GRPO☆39Updated 2 months ago
- AI 原生应用开发实战:基于模型上下文协议MCP AI Native Application Development In Action: Based On Model Context Protocol☆36Updated this week
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数 据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆98Updated last year
- Happy experimenting with MLLM and LLM models!☆103Updated 6 months ago
- ☆13Updated 4 months ago
- 这里用来存储做人工智能项目的代码和参加数据挖掘比赛的代码☆97Updated last month
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆56Updated 3 months ago