cfcys / nanoGPT-Tutorial-CN
更友好的nanoGPT的中文教程
☆107Updated 8 months ago
Alternatives and similar repositories for nanoGPT-Tutorial-CN:
Users that are interested in nanoGPT-Tutorial-CN are comparing it to the libraries listed below
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆135Updated 6 months ago
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆50Updated 11 months ago
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆219Updated 11 months ago
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆76Updated 2 months ago
- 从预训练到强化学习的中文llama2☆84Updated last year
- 模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀☆166Updated 4 months ago
- Chinese large language model☆118Updated last year
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆70Updated 3 weeks ago
- Support mixed-precsion inference with vllm☆79Updated last week
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆762Updated this week
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆452Updated last week
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆228Updated last month
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆393Updated last month
- 【高性能OpenAI LLM服务】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。☆88Updated this week
- Chat-Style-Bot是一个聊天风格模仿大语言模型,通过分析和学习微信聊天记录,可模仿你的说话风格(口头禅等),并可接入微信和你的朋友们自动聊天。Chat-Style-Bot is a chat style imitating llm. By analyzing an…☆75Updated 5 months ago
- 大模型/LLM推理和部署理论与实践☆140Updated this week
- 模型压缩的小白入门教程☆224Updated 2 months ago
- 从零到一实现一个 miniLLM~(动手学习LLM)☆55Updated 8 months ago
- 教你只用最基本的python语法和numpy一步步实现深度学习框架☆95Updated 5 months ago
- ☆52Updated 9 months ago
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆144Updated 3 weeks ago
- LLM101n: Let's build a Storyteller 中文版☆121Updated 5 months ago
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆254Updated last week
- HuatuoGPT2, One-stage Training for Medical Adaption of LLMs. (An Open Medical GPT)☆347Updated 4 months ago
- YiJian-Comunity: a full-process automated large model safety evaluation tool designed for academic research☆105Updated 3 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆44Updated 10 months ago
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆358Updated 4 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated last year
- Align Anything: Training All-modality Model with Feedback☆484Updated this week
- ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference☆145Updated 2 months ago