cfcys / nanoGPT-Tutorial-CN

更友好的nanoGPT的中文教程

☆119

Alternatives and similar repositories for nanoGPT-Tutorial-CN:

Users that are interested in nanoGPT-Tutorial-CN are comparing it to the libraries listed below

wei-potato / Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
☆141Updated 7 months ago
mst272 / LLM-Dojo
欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓
☆527Updated this week
JerryYin777 / NanoGPT-Pytorch2.0-Implementation
This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.
☆52Updated last year
Qcompiler / MIXQ
MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction
☆78Updated 3 months ago
huxiaosheng123 / open-llama2
从预训练到强化学习的中文llama2
☆86Updated last year
zhihu / ZhiLight
A highly optimized LLM inference acceleration engine for Llama and its variants.
☆855Updated this week
Ledzy / BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
☆243Updated 2 months ago
Phoenix8215 / A-White-Paper-on-Neural-Network-Deployment
模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀
☆172Updated 5 months ago
Coobiw / MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…
☆408Updated 2 months ago
xxw1995 / chatglm3-finetune
最容易上手的0门槛 chatglm3 & agent & langchain 项目
☆220Updated last year
datawhalechina / llm-deploy
大模型/LLM推理和部署理论与实践
☆183Updated last week
bbruceyuan / LLMs-101
从零到一实现一个 miniLLM～（动手学习LLM）
☆60Updated 9 months ago
princepride / scratch-pytorch-step-by-step
教你只用最基本的python语法和numpy一步步实现深度学习框架
☆98Updated 6 months ago
SmartFlowAI / LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
☆124Updated 6 months ago
Tongjilibo / build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
☆373Updated 5 months ago
REXWindW / my_llm
尝试自己从头写一个LLM，参考llama和nanogpt
☆56Updated 9 months ago
AI-Study-Han / Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
☆204Updated 5 months ago
NetEase-Media / grps_trtllm
【高性能OpenAI LLM服务】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务，支持chat和function call模式，支持ai agent，支持分布式多卡推理，支持多模态，支持gradio聊天界面。
☆95Updated last week
bbruceyuan / AI-Interview-Code
LLM大模型（重点）以及搜广推等 AI 算法中手写的面试题，（非 LeetCode），比如 Self-Attention, AUC等，一般比 LeetCode 更考察一个人的综合能力，又更贴近业务和基础知识一点
☆134Updated last month
Qcompiler / vllm-mixed-precision
Support mixed-precsion inference with vllm
☆80Updated last month
Phoenix8215 / BuildCudaNeuralNetworkFromScratch
Build CUDA Neural Network From Scratch
☆17Updated 5 months ago
bytedance / ABQ-LLM
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
☆214Updated 4 months ago
zhaibowen / Retriever
Retriever-0.1B
☆82Updated 8 months ago
datawhalechina / llm-research
☆52Updated 10 months ago
Mxoder / LLM-from-scratch
一些 LLM 方面的从零复现笔记
☆164Updated 5 months ago
enze5088 / Chatterbox
Chinese large language model
☆117Updated last year
NetEase-Media / grps
【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架，支持dynamic batching、streaming模式，支持python/c++双语言，可限制，可拓展，高性能。帮助用户快速地将模型部署到线上，并通过http/rpc接口方式…
☆155Updated last month
datawhalechina / unlock-hf
解锁HuggingFace生态的百般用法
☆87Updated 2 months ago