ForceInjection/AI-fundamentals

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ForceInjection/AI-fundamentals)

ForceInjection / AI-fundamentals

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识。

☆1,972

Alternatives and similar repositories for AI-fundamentals

Users that are interested in AI-fundamentals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Infrasys-AI / AIInfra
View on GitHub
AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。
☆7,714Dec 22, 2025Updated 7 months ago
CalvinXKY / InfraTech
View on GitHub
分享AI Infra知识&代码练习：PyTorch、vLLM/SGLang、slime/vime框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
☆3,135Updated this week
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆5,999Updated this week
xlite-dev / LeetCUDA
View on GitHub
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
☆11,631Updated this week
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,635Apr 26, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ForceInjection / kubernetes-hands-on-course
View on GitHub
Kubernetes 动手教程
☆17May 18, 2026Updated 2 months ago
Project-HAMi / HAMi
View on GitHub
Heterogeneous GPU Sharing on Kubernetes
☆4,064Updated this week
cr7258 / ai-infra-learning
View on GitHub
This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.
☆531Mar 1, 2026Updated 4 months ago
pacoxu / AI-Infra
View on GitHub
init to record my learning path of AI Infra, especially on inference.
☆247Updated this week
jinbooooom / ai-infra-hpc
View on GitHub
hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
☆616Apr 27, 2026Updated 2 months ago
zhaochenyang20 / Awesome-ML-SYS-Tutorial
View on GitHub
My learning notes for ML SYS.
☆6,772Updated this week
sgl-project / mini-sglang
View on GitHub
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
☆4,628May 17, 2026Updated 2 months ago
ai-dynamo / dynamo
View on GitHub
A Datacenter Scale Distributed Inference Serving Framework
☆7,580Updated this week
Infrasys-AI / AISystem
View on GitHub
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
☆17,280Sep 3, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
liguodongiot / llm-action
View on GitHub
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
☆24,800Jul 19, 2026Updated last week
kubernetes-sigs / lws
View on GitHub
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆769Updated this week
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,733Updated this week
PaddleJitLab / CUDATutorial
View on GitHub
A self-learning tutorail for CUDA High Performance Programing.
☆1,052Jan 14, 2026Updated 6 months ago
LMCache / LMCache
View on GitHub
LMCache: Supercharge Your LLM with the Fastest KV Cache Layer
☆10,880Updated this week
InftyAI / Manta
View on GitHub
💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…
☆27Dec 6, 2024Updated last year
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆6,032Updated this week
sgl-project / rbg
View on GitHub
A workload for deploying LLM inference services on Kubernetes
☆263Updated this week
CalvinXKY / BasicCUDA
View on GitHub
A tutorial for CUDA&PyTorch
☆478Mar 23, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tile-ai / tilelang
View on GitHub
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
☆6,908Updated this week
taco-project / FlexKV
View on GitHub
☆307Updated this week
volcano-sh / kthena
View on GitHub
Kubernetes-native AI serving platform for scalable model serving.
☆396Updated this week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,138Updated this week
wdndev / llm_interview_note
View on GitHub
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
☆14,759Jun 14, 2026Updated last month
jingyaogong / minimind
View on GitHub
🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!
☆53,838Updated this week
volcano-sh / volcano
View on GitHub
A Cloud Native Batch System (Project under CNCF)
☆5,807Updated this week
Tongkaio / CUDA_Kernel_Samples
View on GitHub
CUDA 算子手撕与面试指南
☆1,044Aug 23, 2025Updated 11 months ago
datawhalechina / self-llm
View on GitHub
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
☆31,416Jul 15, 2026Updated last week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
llm-d / llm-d
View on GitHub
Achieve state of the art inference performance with modern accelerators on Kubernetes
☆3,875Updated this week
ai-dynamo / nixl
View on GitHub
NVIDIA Inference Xfer Library (NIXL)
☆1,151Updated this week
vllm-project / vllm-ascend
View on GitHub
Community maintained hardware plugin for vLLM on Ascend
☆2,478Updated this week
luhengshiwo / LLMForEverybody
View on GitHub
每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈
☆7,017May 31, 2026Updated last month
bytedance / InfiniStore
View on GitHub
KV cache store for distributed LLM inference
☆425Nov 13, 2025Updated 8 months ago
datawhalechina / hello-agents
View on GitHub
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
☆68,564Jul 17, 2026Updated last week
nicexlab / GeminiFS
View on GitHub
GeminiFS: A Companion File System for GPUs
☆84Jul 8, 2026Updated 2 weeks ago