geohot / cuda_ioctl_sniffer
Sniff CUDA ioctls
☆192Updated 2 years ago
Alternatives and similar repositories for cuda_ioctl_sniffer:
Users that are interested in cuda_ioctl_sniffer are comparing it to the libraries listed below
- Enabling tinygrad compatibility with the Google Edge TPU☆77Updated 8 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆187Updated 3 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆129Updated 10 months ago
- Nvidia Instruction Set Specification Generator☆260Updated 10 months ago
- It's a core. Made on Twitch.☆259Updated 3 years ago
- RDNA3 emulator☆54Updated 3 weeks ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆493Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆324Updated this week
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆118Updated 2 years ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆143Updated this week
- Custom PTX Instruction Benchmark☆123Updated 2 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆91Updated last month
- ROCm Communication Collectives Library (RCCL)☆330Updated this week
- Letting computers listen to you and really care☆370Updated 2 years ago
- ☆244Updated 2 months ago
- ☆106Updated last month
- Unpacking AMD's dkms packages☆27Updated last year
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆49Updated 4 years ago
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆247Updated last week
- A profiler to disclose and quantify hardware features on GPUs.☆168Updated 2 years ago
- ☆142Updated this week
- Tenstorrent MLIR compiler☆122Updated this week
- CUDA checkpoint and restore utility☆330Updated 3 months ago
- Development repository for the Triton language and compiler☆118Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆49Updated last month
- Apple G13 GPU architecture docs and tools☆585Updated last month
- rocWMMA☆110Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated last month
- AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver☆359Updated 3 weeks ago
- Assembler for NVIDIA Volta and Turing GPUs☆218Updated 3 years ago