geohot / cuda_ioctl_sniffer
Sniff CUDA ioctls
☆178Updated last year
Related projects ⓘ
Alternatives and complementary repositories for cuda_ioctl_sniffer
- ctypes wrappers for HIP, CUDA, and OpenCL☆126Updated 4 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆147Updated last month
- Enabling tinygrad compatibility with the Google Edge TPU☆75Updated 2 months ago
- ☆382Updated last week
- Nvidia Instruction Set Specification Generator☆216Updated 4 months ago
- It's a core. Made on Twitch.☆251Updated 3 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆270Updated this week
- Scripts and environment for the tinybox☆92Updated 6 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆124Updated this week
- RDNA3 emulator☆46Updated last week
- ☆128Updated this week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆55Updated this week
- Tutorials on tinygrad☆180Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆405Updated last year
- Apple GPU microarchitecture☆473Updated last month
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆69Updated last week
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆39Updated 3 weeks ago
- ☆88Updated this week
- rocWMMA☆91Updated this week
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆49Updated 3 years ago
- ROCm BLAS marshalling library☆118Updated this week
- ☆224Updated 2 months ago
- Letting computers listen to you and really care☆367Updated 2 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆67Updated last year
- Because tinygrad got out of hand with line count☆145Updated last month
- TPP experimentation on MLIR for linear algebra☆110Updated last month
- An experimental CPU backend for Triton☆56Updated last week
- C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!☆488Updated last month
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆103Updated last year