geohot / cuda_ioctl_snifferLinks
Sniff CUDA ioctls
☆205Updated 2 years ago
Alternatives and similar repositories for cuda_ioctl_sniffer
Users that are interested in cuda_ioctl_sniffer are comparing it to the libraries listed below
Sorting:
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- Nvidia Instruction Set Specification Generator☆285Updated last year
- GPUOcelot: A dynamic compilation framework for PTX☆204Updated 5 months ago
- Enabling tinygrad compatibility with the Google Edge TPU☆78Updated 11 months ago
- ☆449Updated 3 months ago
- ☆53Updated this week
- RDNA3 emulator☆54Updated 3 months ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆709Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆70Updated 4 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆124Updated 2 weeks ago
- A profiler to disclose and quantify hardware features on GPUs.☆173Updated 3 years ago
- ☆115Updated this week
- Custom PTX Instruction Benchmark☆126Updated 5 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 4 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆345Updated this week
- Apple G13 GPU architecture docs and tools☆597Updated 2 months ago
- Apple AMX Instruction Set☆1,121Updated 7 months ago
- Tenstorrent MLIR compiler☆165Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆138Updated this week
- It's a core. Made on Twitch.☆261Updated 3 years ago
- Learning about CUDA by writing PTX code.☆133Updated last year
- Tenstorrent's MLIR Based Compiler. We aim to enable developers to run AI on all configurations of Tenstorrent hardware, through an open-s…☆96Updated this week
- Apple GPU microarchitecture☆540Updated 10 months ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆85Updated last year
- Apple Firestorm/Icestorm CPU microarchitecture docs☆241Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆525Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆111Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆145Updated this week
- ☆148Updated this week
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago