0xD0GF00D / DocumentSASSLinks
Unofficial description of the CUDA assembly (SASS) instruction sets.
☆94Updated 2 months ago
Alternatives and similar repositories for DocumentSASS
Users that are interested in DocumentSASS are comparing it to the libraries listed below
Sorting:
- ☆96Updated last year
- Dissecting NVIDIA GPU Architecture☆95Updated 2 years ago
- ☆52Updated 5 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆135Updated last week
- A highly-flexible GPU simulator for AMD GPUs.☆151Updated this week
- ☆44Updated 4 years ago
- ☆35Updated 3 years ago
- ☆250Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆218Updated 3 years ago
- ☆39Updated 2 weeks ago
- ☆146Updated this week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆127Updated 5 months ago
- TPP experimentation on MLIR for linear algebra☆131Updated last week
- ☆100Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆131Updated last year
- GPUOcelot: A dynamic compilation framework for PTX☆192Updated 3 months ago
- MLIR Sample dialect☆123Updated 3 months ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆79Updated 2 years ago
- IREE plugin repository for the AMD AIE accelerator☆97Updated this week
- Trying to figure various CPU things out☆78Updated last year
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year
- Bridging polyhedral analysis tools to the MLIR framework☆111Updated last year
- tutorials about polyhedral compilation.☆41Updated 4 months ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆506Updated 2 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 2 months ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆86Updated this week
- An extension library of WMMA API (Tensor Core API)☆97Updated 10 months ago
- The University of Bristol HPC Simulation Engine☆96Updated last week
- GPU Performance Advisor☆65Updated 2 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated last year