FelixFu520 / README
A pupil in the computer world.(Felix Fu)
☆212Updated 8 months ago
Alternatives and similar repositories for README:
Users that are interested in README are comparing it to the libraries listed below
- A tutorial for CUDA&PyTorch☆126Updated 3 weeks ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆77Updated last month
- ☆108Updated 10 months ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆93Updated 11 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆53Updated last month
- learning how CUDA works☆197Updated 6 months ago
- Summary of some awesome work for optimizing LLM inference☆56Updated last week
- FlagScale is a large model toolkit based on open-sourced projects.☆217Updated this week
- Inference code for LLaMA models☆113Updated last year
- 《CUDA编程基础与实践》一书的代码☆106Updated 2 years ago
- A minimalist and extensible PyTorch extension for implementing custom backend operators in PyTorch.☆31Updated 10 months ago
- how to learn PyTorch and OneFlow☆392Updated 10 months ago
- CUDA 算子手撕与面试指南☆147Updated last month
- This repository is established to store personal notes and annotated papers during daily research.☆109Updated this week
- 使用 CUDA C++ 实现的 llama 模型推理框架☆44Updated 3 months ago
- LLM101n: Let's build a Storyteller 中文版☆124Updated 6 months ago
- 高性能计算课程&CUDA编程实例&深度学习推理框架☆36Updated last year
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆46Updated 7 months ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆128Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆299Updated this week
- ☆120Updated last year
- ☆97Updated 6 months ago
- pytorch distribute tutorials☆103Updated this week
- 模型压缩的小白入门教程☆236Updated 2 months ago
- ☆13Updated last year
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆473Updated this week
- A light llama-like llm inference framework based on the triton kernel.☆83Updated this week
- LLM Inference benchmark☆391Updated 6 months ago
- ☆33Updated last year