cfcys / nanoGPT-Tutorial-CN
更友好的nanoGPT的中文教程
☆97Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for nanoGPT-Tutorial-CN
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆154Updated 3 months ago
- 从预训练到强化学习的中文llama2☆94Updated last year
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆59Updated 8 months ago
- 【grps接入trtllm】通过接入TensorRT-LLM以及Tokenizers.cpp实现纯c++版本高性能LLM服务,兼容OpenAI接口协议,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界…☆87Updated last week
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆244Updated 9 months ago
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆324Updated this week
- Chinese large language model☆130Updated last year
- 模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀☆185Updated last month
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆209Updated last month
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆376Updated last month
- Build CUDA Neural Network From Scratch☆17Updated 2 months ago
- Support mixed-precsion inference with vllm☆94Updated this week
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆76Updated last week
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆47Updated 11 months ago
- A股量化实盘、模拟实盘。本项目基于券商接口,实现小市值策略DEMO在线实盘交易。您可参照本项目轻松实盘您的本地策略 #A股 #量化☆64Updated 3 weeks ago
- Aiming to build the most comprehensive machine learning blog.☆153Updated this week
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆55Updated 8 months ago
- 基于国内大学构造的neo4j知识图谱,并进行简单问答,帮助了解大学,填报高考志愿☆57Updated 2 years ago
- HuatuoGPT2, One-stage Training for Medical Adaption of LLMs. (An Open Medical GPT)☆361Updated 2 months ago
- 【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式…☆156Updated this week
- [EMNLP 2023] FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models☆85Updated 10 months ago
- The framework to prune LLMs to any size and any config.☆99Updated 8 months ago
- A collection of papers related to knowledge fusion☆63Updated last month
- 保险行业回访外呼机器人☆74Updated last year
- A python package that takes tables from a web page and processes them to get high quality tables☆53Updated 2 years ago
- RASA中文任务型机器人☆98Updated last week
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆182Updated last month
- [NeurIPS'24] Knowledge Graph Fine-Tuning with Open-World Knowledge☆101Updated last month
- 教你只用最基本的python语法和numpy一步步实现深度学习框架☆120Updated 3 months ago