chenzomi12 / chenzomi12.github.io

☆200

Related projects ⓘ

Alternatives and complementary repositories for chenzomi12.github.io

chenzomi12 / AIFoundation
AIFoundation 主要是指AI系统遇到大模型，从底层到上层如何系统级地支持大模型训练和推理，全栈的核心技术。
☆275Updated last month
zjhellofss / KuiperLLama
校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
☆220Updated this week
datawhalechina / awesome-compression
模型压缩的小白入门教程
☆184Updated this week
ifromeast / cuda_learning
learning how CUDA works
☆162Updated 2 months ago
sunkx109 / llama
Inference code for LLaMA models
☆107Updated last year
RussWong / CUDATutorial
A CUDA tutorial to make people learn CUDA program from 0
☆195Updated 4 months ago
SmartFlowAI / LLM101n-CN
LLM101n: Let's build a Storyteller 中文版
☆116Updated 2 months ago
ModelTC / llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…
☆315Updated this week
harleyszhang / dl_note
深度学习系统笔记，包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。
☆381Updated last week
pcg-mlp / KsanaLLM
☆282Updated last week
BBuf / how-to-learn-deep-learning-framework
how to learn PyTorch and OneFlow
☆347Updated 7 months ago
PaddleJitLab / CUDATutorial
A self-learning tutorail for CUDA High Performance Programing.
☆246Updated this week
CalvinXKY / BasicCUDA
A tutorial for CUDA&PyTorch
☆117Updated last week
zjhellofss / kuiperdatawhale
☆220Updated last month
wangzhaode / llm-export
llm-export can export llm model to onnx.
☆226Updated this week
OpenPPL / ppl.llm.serving
☆123Updated this week
Eddie-Wang1120 / Professional-CUDA-C-Programming-Code-and-Notes
CUDA C 编程权威指南代码实现包含了书上第二章到第八章的大部分代码实现和作者笔记，全由作者本人手动实现，难免有错误的地方，请大家谨慎参考，非常欢迎对错误的指正。如果有帮助的话请Star一下，对作者帮助很大，谢谢！
☆279Updated 2 years ago
MAhaitao999 / CUDA_Programming
《CUDA编程基础与实践》一书的代码
☆94Updated 2 years ago
sunkx109 / llama.cpp
llama 2 Inference
☆37Updated last year
alibaba / rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
☆541Updated 3 weeks ago
Eddie-Wang1120 / HPC-Learning-Notes
高性能计算相关知识学习笔记，包含学习笔记和相关知识的代码demo，在持续完善中。如果有帮助的话请Star一下，对作者帮助很大，谢谢！
☆371Updated last year
FlagOpen / FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
☆329Updated this week
QINZHAOYU / CudaSteps
基于《cuda编程-基础与实践》（樊哲勇著）的cuda学习之路。
☆246Updated 9 months ago
mlc-ai / mlc-zh
☆588Updated 5 months ago
DeepLink-org / DIOPI
☆68Updated 3 weeks ago
wdndev / llama3-from-scratch-zh
从零实现一个 llama3 中文版
☆531Updated 4 months ago
intelligent-machine-learning / glake
GLake: optimizing GPU memory management and IO transmission.
☆376Updated 3 months ago
OpenPPL / ppl.llm.kernel.cuda
☆136Updated this week
OpenPPL / ppl.nn.llm
☆140Updated 6 months ago
owenliang / qwen-vllm
通义千问VLLM推理部署DEMO
☆438Updated 7 months ago