0xD0GF00D / DocumentSASSLinks
Unofficial description of the CUDA assembly (SASS) instruction sets.
☆142Updated 2 months ago
Alternatives and similar repositories for DocumentSASS
Users that are interested in DocumentSASS are comparing it to the libraries listed below
Sorting:
- ☆107Updated last year
- Dissecting NVIDIA GPU Architecture☆105Updated 3 years ago
- ☆150Updated this week
- ☆278Updated 3 months ago
- ☆43Updated 2 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆135Updated 8 months ago
- ☆45Updated 4 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆230Updated 3 years ago
- AMD RAD's experimental RMA library for Triton.☆63Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆143Updated 2 weeks ago
- TPP experimentation on MLIR for linear algebra☆136Updated last month
- GPUOcelot: A dynamic compilation framework for PTX☆206Updated 7 months ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆98Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆114Updated 3 months ago
- MLIR Sample dialect☆129Updated 7 months ago
- An experimental CPU backend for Triton☆148Updated 3 months ago
- GPU Performance Advisor☆66Updated 3 years ago
- Benchmark Framework for Buddy Projects☆55Updated 2 months ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆77Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆534Updated 2 years ago
- ☆54Updated 5 years ago
- amdgpu example code in hip/asm☆42Updated this week
- CUTLASS and CuTe Examples☆75Updated 2 months ago
- tutorials about polyhedral compilation.☆53Updated 7 months ago
- Triton to TVM transpiler.☆22Updated 11 months ago
- CUDA Matrix Multiplication Optimization☆222Updated last year
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Updated 2 months ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆84Updated 2 years ago
- ☆38Updated 3 years ago
- A language and compiler for irregular tensor programs.☆149Updated 9 months ago