cfcys / nanoGPT-Tutorial-CN
更友好的nanoGPT的中文教程
☆119Updated 9 months ago
Alternatives and similar repositories for nanoGPT-Tutorial-CN:
Users that are interested in nanoGPT-Tutorial-CN are comparing it to the libraries listed below
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆141Updated 7 months ago
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆527Updated this week
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆52Updated last year
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆78Updated 3 months ago
- 从预训练到强化学习的中文llama2☆86Updated last year
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆855Updated this week
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆243Updated 2 months ago
- 模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀☆172Updated 5 months ago
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆408Updated 2 months ago
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆220Updated last year
- 大模型/LLM推理和部署理论与实践☆183Updated last week
- 从零到一实现一个 miniLLM~(动手学习LLM)☆60Updated 9 months ago
- 教你只用最基本的python语法和numpy一步步实现深度学习框架☆98Updated 6 months ago
- LLM101n: Let's build a Storyteller 中文版☆124Updated 6 months ago
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆373Updated 5 months ago
- 尝试自己从头写一个LLM,参考llama和nanogpt☆56Updated 9 months ago
- 从0开始,将chatgpt的技术路线跑一遍。☆204Updated 5 months ago
- 【高性能OpenAI LLM服务】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。☆95Updated last week
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆134Updated last month
- Support mixed-precsion inference with vllm☆80Updated last month
- Build CUDA Neural Network From Scratch☆17Updated 5 months ago
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆214Updated 4 months ago
- Retriever-0.1B☆82Updated 8 months ago
- ☆52Updated 10 months ago
- 一些 LLM 方面的从零复现笔记☆164Updated 5 months ago
- Chinese large language model☆117Updated last year
- 【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架 ,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式…☆155Updated last month
- 解锁HuggingFace生态的百般用法☆87Updated 2 months ago