sun1638650145 / deep-rl-class-zhLinks
Hugging Face 深度强化学习课程(中文版)
☆22Updated 3 years ago
Alternatives and similar repositories for deep-rl-class-zh
Users that are interested in deep-rl-class-zh are comparing it to the libraries listed below
Sorting:
- ☆174Updated last year
- deep learning☆149Updated 8 months ago
- 通过动画学强化学习笔记☆65Updated 11 months ago
- 解锁HuggingFace生态的百般用法☆98Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Updated 2 years ago
- pytorch分布式训练☆73Updated 2 years ago
- qwen models finetuning☆105Updated 10 months ago
- ☆77Updated 2 years ago
- 大语言模型训练和服务调研☆37Updated 2 years ago
- Awesome Colab Projects Collection☆29Updated 2 years ago
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆145Updated last year
- ☆157Updated 2 years ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆100Updated last year
- bilibili video course src code☆419Updated 2 years ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆33Updated last year
- LoRA☆18Updated 2 years ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆62Updated last year
- 文本去重☆77Updated last year
- The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1☆273Updated 10 months ago
- 大型语言模型实战指南:应用实践与场景落地☆85Updated last year
- The Roadmap for LLMs☆86Updated 2 years ago
- ☆284Updated last month
- Baichuan2代码的逐行解析版本,适合小白☆213Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆72Updated last year
- 用于汇总目前的开源中文对话数据集☆199Updated 2 years ago
- 千问14B和7B的逐行解释☆63Updated 2 years ago
- 多轮共情对话模型PICA☆97Updated 2 years ago
- AGI调研资料汇总☆24Updated 4 months ago