sun1638650145 / deep-rl-class-zhLinks
Hugging Face 深度强化学习课程(中文版)
☆22Updated 3 years ago
Alternatives and similar repositories for deep-rl-class-zh
Users that are interested in deep-rl-class-zh are comparing it to the libraries listed below
Sorting:
- An easier PyTorch deep reinforcement learning library.☆244Updated 11 months ago
- pytorch分布式训练☆72Updated 2 years ago
- deep learning☆149Updated 7 months ago
- 通过动画学强化学习笔记☆63Updated 9 months ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆33Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆97Updated last year
- LoRA☆18Updated 2 years ago
- 千问14B和7B的逐行解释☆63Updated 2 years ago
- The Roadmap for LLMs☆86Updated 2 years ago
- ☆76Updated 2 years ago
- 解锁HuggingFace生态的百般用法☆97Updated 11 months ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- bilibili video course src code☆407Updated 2 years ago
- 大型语言模型实战指南:应用实践与场景落地☆83Updated last year
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆73Updated 10 months ago
- 通义千问的DPO训练☆60Updated last year
- The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1☆273Updated 9 months ago
- ☆19Updated last year
- ☆169Updated last year
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解 相关知识。☆62Updated 11 months ago
- "桃李“: 国际中文教育大模型☆189Updated 2 years ago
- 一些 LLM 方面的从零复现笔记☆238Updated 7 months ago
- 阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理☆129Updated last year
- ☆281Updated 2 weeks ago
- everything about llm & aigc☆109Updated last week
- simple decoder-only GTP model in pytorch☆43Updated last year
- 在中文开源大模型的基础上进行定制化的微调,拥有自己专属的语言模型。☆51Updated 2 years ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆70Updated last year
- ☆118Updated last year
- ☆194Updated 10 months ago