ptillet / triton-llvm-releasesLinks
☆22Updated 2 years ago
Alternatives and similar repositories for triton-llvm-releases
Users that are interested in triton-llvm-releases are comparing it to the libraries listed below
Sorting:
- Benchmark tests supporting the TiledCUDA library.☆17Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated last year
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆25Updated last week
- GPTQ inference TVM kernel☆40Updated last year
- CUDA 12.2 HMM demos☆20Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆100Updated 4 months ago
- Framework to reduce autotune overhead to zero for well known deployments.☆85Updated 2 months ago
- ☆50Updated last year
- PyTorch implementation of the Flash Spectral Transform Unit.☆20Updated last year
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Updated last year
- ☆22Updated last year
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- ☆23Updated 6 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆16Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 3 months ago
- ☆71Updated 7 months ago
- Ahead of Time (AOT) Triton Math Library☆84Updated last week
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆30Updated 11 months ago
- ☆60Updated this week
- ☆14Updated 3 weeks ago