InfiniTensor / operators
算子库
☆15Updated last month
Alternatives and similar repositories for operators:
Users that are interested in operators are comparing it to the libraries listed below
- ☆100Updated last week
- ☆226Updated last month
- ☆47Updated 3 months ago
- ☆24Updated 2 months ago
- ☆49Updated last month
- 解读cudnn文档,掌握其用法☆16Updated 10 months ago
- easy cuda code☆66Updated 2 months ago
- 分层解耦的深度学习推理引擎☆72Updated 3 weeks ago
- some hpc project for learning☆20Updated 6 months ago
- ☆22Updated this week
- 先进编译实验室的个人主页☆44Updated last month
- ☆26Updated this week
- CUDA SGEMM optimization note☆13Updated last year
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆14Updated 2 months ago
- 《自己动手写AI编译器》☆21Updated 4 months ago
- ☆105Updated 3 months ago
- CUDA 算子手撕与面试指南☆206Updated last month
- Some common CUDA kernel implementations (Not the fastest).☆16Updated 3 weeks ago
- A light llama-like llm inference framework based on the triton kernel.☆96Updated this week
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆61Updated 2 years ago
- ☆70Updated last year
- CUDA PTX-ISA Document 中文翻译版☆37Updated 2 months ago
- b站上的课程☆71Updated last year
- A domain-specific language (DSL) based on Triton but providing higher-level abstractions.☆18Updated last week
- A PyTorch-like deep learning framework. Just for fun.☆145Updated last year
- Hands-On Practical MLIR Tutorial☆17Updated 7 months ago
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆12Updated 5 months ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆45Updated last year
- 使用 CUDA C++ 实现的 llama 模型推理框架☆48Updated 4 months ago
- 笔记☆37Updated last month