大模型/LLM推理和部署理论与实践
☆377Jul 14, 2025Updated 7 months ago
Alternatives and similar repositories for llm-deploy
Users that are interested in llm-deploy are comparing it to the libraries listed below
Sorting:
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆356Nov 25, 2025Updated 3 months ago
- 解锁HuggingFace生态的百般用法☆98Dec 14, 2024Updated last year
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,563Feb 12, 2026Updated 3 weeks ago
- wow-fullstack,令人惊叹的全栈开发教程☆246Updated this week
- 通过带领大家解读Transformer模型来加深对模型的理解☆235Jun 3, 2025Updated 9 months ago
- 向量检索与 RAG 实践:技术、实现与应用☆151Nov 5, 2024Updated last year
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆28,676Feb 24, 2026Updated last week
- 大模型基础: 一文了解大模型基础知识☆6,806Dec 18, 2025Updated 2 months ago
- This is a multi agent tutorial based on the CAMEL framework, aimed at understanding how to build an Agent Society from the ground up!☆738Jan 16, 2026Updated last month
- A simple and trans-platform rag framework and tutorial☆229Jan 17, 2026Updated last month
- 本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/☆12,061Feb 24, 2026Updated last week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,984Aug 15, 2024Updated last year
- HuggingLLM, Hugging Future.☆3,056Aug 30, 2025Updated 6 months ago
- DeepSeek 系列工作解读、扩展和复现。☆699Mar 29, 2025Updated 11 months ago
- 🐳 LeetCode 算法笔记:面试、刷题、学算法。在线阅读地址:https://datawhalechina.github.io/leetcode-notes/☆1,060Oct 9, 2025Updated 4 months ago
- 共学《MCP极简开发》项目代码☆46Jul 21, 2025Updated 7 months ago
- ☆85Apr 9, 2024Updated last year
- 一个构建“听话”提示词的教程☆57Feb 20, 2025Updated last year
- 一份全栈式大语言模型参考指南,用最简洁的代码帮助你端到端定义模型从零训练到工程落地的每一个细节☆130Jan 15, 2026Updated last month
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆23,265Updated this week
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Building BERT Model with PyTorch☆23Dec 9, 2024Updated last year
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- Linux操作系统学习笔记☆20Jan 11, 2024Updated 2 years ago
- ☆13Dec 21, 2024Updated last year
- ggml学习笔记,ggml是一个机器学 习的推理框架☆18Mar 24, 2024Updated last year
- The Role Playing Project of Honor-of-Kings Based on LnternLM2。峡谷小狐仙--王者荣耀领域的角色扮演聊天机器人,结合多模态技术将英雄妲己的形象带入大模型中。☆28Jul 24, 2024Updated last year
- ☆103Mar 20, 2024Updated last year
- 本项目为量化开源课程,可以帮助人们快速掌握量化金融知识以及使用Python进行量化开发的能力。☆2,068Jan 15, 2026Updated last month
- ☆287Nov 26, 2025Updated 3 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- A simple and trans-platform agent framework and tutorial☆200Jan 17, 2026Updated last month
- ☆33Jul 8, 2025Updated 7 months ago
- Hugging Vision, Hugging AGI.☆178Nov 13, 2025Updated 3 months ago
- 制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程☆1,697Apr 29, 2025Updated 10 months ago
- 动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/☆2,241Jan 15, 2026Updated last month
- Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star☆3,539Dec 2, 2025Updated 3 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆12,768Apr 30, 2025Updated 10 months ago