geohot / cuda_ioctl_sniffer
Sniff CUDA ioctls
☆184Updated last year
Alternatives and similar repositories for cuda_ioctl_sniffer:
Users that are interested in cuda_ioctl_sniffer are comparing it to the libraries listed below
- Enabling tinygrad compatibility with the Google Edge TPU☆75Updated 4 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆128Updated 6 months ago
- Nvidia Instruction Set Specification Generator☆235Updated 6 months ago
- ☆418Updated last month
- GPUOcelot: A dynamic compilation framework for PTX☆157Updated 3 weeks ago
- A profiler to disclose and quantify hardware features on GPUs.☆165Updated 2 years ago
- OpenAI Triton backend for Intel® GPUs☆154Updated this week
- It's a core. Made on Twitch.☆255Updated 3 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆291Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆37Updated 8 months ago
- Tenstorrent MLIR compiler☆85Updated this week
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆49Updated 4 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆47Updated 2 months ago
- RDNA3 emulator☆49Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆415Updated last year
- ☆99Updated 2 months ago
- ☆228Updated last month
- ☆131Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆126Updated this week
- rocWMMA☆97Updated this week
- ROCm BLAS marshalling library☆125Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆99Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆204Updated 3 years ago
- ☆62Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆234Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆75Updated this week
- A GPU-driven system framework for scalable AI applications☆111Updated 3 months ago
- Development repository for the Triton language and compiler☆102Updated this week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆128Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆127Updated last year