大模型/LLM推理和部署理论与实践
☆402Jul 14, 2025Updated 11 months ago
Alternatives and similar repositories for llm-deploy
Users that are interested in llm-deploy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆369Nov 25, 2025Updated 6 months ago
- 解锁HuggingFace生态的百般用法☆97Dec 14, 2024Updated last year
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,902Feb 12, 2026Updated 4 months ago
- 通过带领大家解读Transformer模型来加深对模型的理解☆248Jun 3, 2025Updated last year
- wow-fullstack,令人惊叹的全栈开发教程☆279Jun 8, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 向量检索与 RAG 实践:技术、实现与应用☆164Nov 5, 2024Updated last year
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆30,837Jun 3, 2026Updated last week
- 大模型基础: 一文了解大模型基础知识☆7,369Dec 18, 2025Updated 5 months ago
- 本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/☆13,232Feb 24, 2026Updated 3 months ago
- This is a multi agent tutorial based on the CAMEL framework, aimed at understanding how to build an Agent Society from the ground up!☆768Jan 16, 2026Updated 4 months ago
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆4,205Mar 26, 2026Updated 2 months ago
- ☆87Apr 9, 2024Updated 2 years ago
- 一个构建“听话”提示词的教程☆59Feb 20, 2025Updated last year
- HuggingLLM, Hugging Future.☆3,065Aug 30, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DeepSeek 系列工作解读、扩展和复现。☆733Mar 9, 2026Updated 3 months ago
- A simple and trans-platform rag framework and tutorial☆232Jan 17, 2026Updated 4 months ago
- Building BERT Model with PyTorch☆22Dec 9, 2024Updated last year
- Hugging Vision, Hugging AGI.☆183Nov 13, 2025Updated 7 months ago
- 本项目旨在分享大模型相 关技术原理以及实战经验(大模型工程化、大模型应用落地)☆24,518May 25, 2026Updated 3 weeks ago
- 🐳 LeetCode 算法笔记:面试、刷题、学算法。在线阅读地址:https://datawhalechina.github.io/leetcode-notes/☆1,115Oct 9, 2025Updated 8 months ago
- A simple and trans-platform agent framework and tutorial☆207Jan 17, 2026Updated 4 months ago
- 共学《MCP极简开发》项目代码☆48Jul 21, 2025Updated 10 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/☆2,445Jan 15, 2026Updated 5 months ago
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- 本项目为量化开源课程,可以帮助人们快速掌握量化金融知识以及使用Python进行量化开发的能力。☆2,466Jan 15, 2026Updated 5 months ago
- ☆20Dec 29, 2023Updated 2 years ago
- ☆103Mar 20, 2024Updated 2 years ago
- Linux操作系统学习笔记☆19Jan 11, 2024Updated 2 years ago
- ☆297May 14, 2026Updated last month
- 一份全栈式大语言模型参考指南,用最简洁的代码帮助你端到端定义模型从零训练到工程落地的每一个细节☆191Jan 15, 2026Updated 5 months ago
- Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star☆3,712Dec 2, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆82Sep 6, 2024Updated last year
- 制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程☆1,796Apr 29, 2025Updated last year
- ☆11Dec 21, 2024Updated last year
- 本仓库将带大家从零开始,用pytorch的线性层搭建传统的NLP神经网络☆42Dec 8, 2024Updated last year
- 该项目围绕 Coze 打造 AI 私人提效助理展开,整合实用 AI 工作流并做拆解,同时准备提示词手册和案例手册,旨在展示项目可行性,帮助学习者更好地理解和实操相关技能。☆256Jan 26, 2026Updated 4 months ago
- 本教程将全面指导你如何快速搭建自己的AI应用环境,从Docker桌面版的安装与配置开始,到本地部署Dify并自定义AI助手功能,让你轻松实现“猜病例”、“甜蜜哄人”、“新生入学指南”、“小红书读书卡片”与“面试宝典”等多种特色AI应用。并教会你从基础智能体到使用工作流,再到…☆487Dec 21, 2025Updated 5 months ago
- 面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版☆24,238Jun 12, 2025Updated last year