geohot / cuda_ioctl_sniffer
Sniff CUDA ioctls
☆189Updated last year
Alternatives and similar repositories for cuda_ioctl_sniffer:
Users that are interested in cuda_ioctl_sniffer are comparing it to the libraries listed below
- Enabling tinygrad compatibility with the Google Edge TPU☆75Updated 5 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆128Updated 7 months ago
- Nvidia Instruction Set Specification Generator☆243Updated 7 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆169Updated last week
- ☆428Updated 2 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆59Updated 3 months ago
- It's a core. Made on Twitch.☆258Updated 3 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆79Updated this week
- RDNA3 emulator☆51Updated 2 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆304Updated this week
- ☆233Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆426Updated last year
- A profiler to disclose and quantify hardware features on GPUs.☆166Updated 2 years ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆135Updated this week
- Letting computers listen to you and really care☆369Updated 2 years ago
- Tenstorrent MLIR compiler☆91Updated this week
- Scripts and environment for the tinybox☆92Updated 9 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆38Updated 9 months ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆657Updated this week
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆84Updated 10 months ago
- Tutorials on tinygrad☆342Updated this week
- Apple G13 GPU architecture docs and tools☆577Updated 9 months ago
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆49Updated 4 years ago
- ☆137Updated this week
- TPP experimentation on MLIR for linear algebra☆119Updated this week
- High-Performance SGEMM on CUDA devices☆76Updated last month
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆113Updated 2 years ago
- Development repository for the Triton language and compiler☆108Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆100Updated this week