ForceInjection / AI-fundermentalsLinks
AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识
☆148Updated last week
Alternatives and similar repositories for AI-fundermentals
Users that are interested in AI-fundermentals are comparing it to the libraries listed below
Sorting:
- ☆455Updated this week
- A self-learning tutorail for CUDA High Performance Programing.☆674Updated last week
- how to learn PyTorch and OneFlow☆441Updated last year
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆668Updated last year
- GLake: optimizing GPU memory management and IO transmission.☆470Updated 3 months ago
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆794Updated this week
- This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.☆49Updated this week
- ☆317Updated last week
- KV cache store for distributed LLM inference☆288Updated last month
- UltraScale Playbook 中文版☆45Updated 3 months ago
- Efficient and easy multi-instance LLM serving☆445Updated this week
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆381Updated last week
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆249Updated 6 months ago
- 一个手把手教你从零开始编写GPT并训练大语言模型的教程☆82Updated 5 months ago
- This repo is used for archiving my notes, codes and materials of cs learning.☆38Updated this week
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆93Updated last month
- ☆123Updated 4 months ago
- Materials for learning SGLang☆475Updated this week
- CUDA 算子手撕与面试指南☆461Updated 5 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆809Updated last month
- A pupil in the computer world.(Felix Fu)☆238Updated last year
- Community maintained hardware plugin for vLLM on Ascend☆865Updated this week
- ☆279Updated 9 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆321Updated this week
- Learning Machine Learning, The Chinese Taoist Way☆428Updated 5 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆73Updated 2 months ago
- A light llama-like llm inference framework based on the triton kernel.☆133Updated last week
- Disaggregated serving system for Large Language Models (LLMs).☆639Updated 3 months ago
- LLM Inference benchmark☆421Updated 11 months ago
- LLM全栈优质资源汇总☆585Updated this week