lcy-seso / LearningNotes
Ying's notes
☆7Updated last month
Alternatives and similar repositories for LearningNotes:
Users that are interested in LearningNotes are comparing it to the libraries listed below
- ☆68Updated 3 months ago
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- ☆38Updated 3 years ago
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆50Updated 9 months ago
- System for automated integration of deep learning backends.☆47Updated 2 years ago
- Benchmark code for the "Online normalizer calculation for softmax" paper☆91Updated 6 years ago
- A home for the final text of all TVM RFCs.☆102Updated 7 months ago
- ☆43Updated last year
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 5 years ago
- High performance NCCL plugin for Bagua.☆15Updated 3 years ago
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆106Updated 9 months ago
- ☆148Updated 3 months ago
- ☆102Updated last month
- ☆92Updated 7 months ago
- ☆9Updated last year
- play gemm with tvm☆90Updated last year
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆63Updated 8 months ago
- DietCode Code Release☆63Updated 2 years ago
- llama INT4 cuda inference with AWQ☆54Updated 3 months ago
- ☆79Updated 2 years ago
- ☆79Updated this week
- ☆23Updated 5 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆91Updated 3 weeks ago
- Repository for SysML19 Artifacts Evaluation☆53Updated 6 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆35Updated 2 months ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆108Updated 2 years ago
- Place for meetup slides☆140Updated 4 years ago
- Dissecting NVIDIA GPU Architecture☆92Updated 2 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆176Updated 3 years ago
- this is the release repository of superneurons☆52Updated 4 years ago