cfcys / nanoGPT-Tutorial-CN
更友好的nanoGPT的中文教程
☆99Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for nanoGPT-Tutorial-CN
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆155Updated 4 months ago
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆60Updated 9 months ago
- 从预训练到强化学习的中文llama2☆94Updated last year
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆351Updated 2 weeks ago
- A股量化实盘、模拟实盘。本项目基于券商接口,实现小市值策略DEMO在线实盘交易。您可参照本项目轻松实盘您的本地策略 #A股 #量化☆78Updated last month
- 【grps接入trtllm】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。☆92Updated 3 weeks ago
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆244Updated 9 months ago
- Chinese large language model☆132Updated last year
- Support mixed-precsion inference with vllm☆97Updated 2 weeks ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆184Updated last week
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆55Updated 8 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆215Updated this week
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆381Updated 2 months ago
- 模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀☆186Updated 2 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆47Updated 11 months ago
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆83Updated 3 weeks ago
- Build CUDA Neural Network From Scratch☆19Updated 2 months ago
- HuatuoGPT2, One-stage Training for Medical Adaption of LLMs. (An Open Medical GPT)☆366Updated 2 months ago
- The framework to prune LLMs to any size and any config.☆99Updated 8 months ago
- Controllable Text Generation for Large Language Models: A Survey☆143Updated 2 months ago
- 保险行业回访外呼机器人☆74Updated last year
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆162Updated this week
- A collection of papers related to knowledge fusion☆63Updated last month
- 基于国内大学构造的neo4j知识图谱,并进行简单问答,帮助了解大学,填报高考志愿☆57Updated 2 years ago
- 【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式…☆167Updated last week
- 教你只用最基本的python语法和numpy一步步实现深度学习框架☆120Updated 3 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆147Updated last month
- Chat-Style-Bot是一个聊天风格模仿大语言模型,通过分析和学习微信聊天记录,可模仿你的说话风格(口头禅等),并可接入微信和你的朋友们自动聊天。Chat-Style-Bot is a chat style imitating llm. By analyzing an…☆85Updated 4 months ago
- A python package that takes tables from a web page and processes them to get high quality tables☆53Updated 2 years ago
- Aiming to build the most comprehensive machine learning blog.☆153Updated this week