dougallj / applegpu
Apple G13 GPU architecture docs and tools
☆579Updated this week
Alternatives and similar repositories for applegpu:
Users that are interested in applegpu are comparing it to the libraries listed below
- Apple GPU microarchitecture☆504Updated 6 months ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆237Updated last year
- Apple AMX Instruction Set☆1,058Updated 2 months ago
- Dissecting the M1's GPU for 3D acceleration☆1,002Updated 2 years ago
- CLI Tools For ANE☆118Updated 3 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆168Updated 4 months ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆399Updated last year
- Nvidia Instruction Set Specification Generator☆253Updated 8 months ago
- Reverse engineering Rosetta 2 on M1 Mac☆394Updated 3 years ago
- ☆437Updated last week
- Kernel extension that enables TSO for Apple silicon processes☆259Updated last year
- ☆273Updated 2 months ago
- Sniff CUDA ioctls☆190Updated last year
- It's a core. Made on Twitch.☆258Updated 3 years ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆33Updated 2 years ago
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆661Updated last week
- Run a CoreML MLModel on the Asahi Neural Engine☆47Updated last year
- Preloader for Linux on M1☆99Updated 4 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆135Updated 2 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆167Updated 2 years ago
- Documentation of NVIDIA chip/hardware interfaces☆1,269Updated 6 months ago
- Enabling tinygrad compatibility with the Google Edge TPU☆76Updated 6 months ago
- A tool and a library for bi-directional translation between SPIR-V and LLVM IR☆530Updated this week
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆128Updated 4 years ago
- GPUOcelot: A dynamic compilation framework for PTX☆181Updated last month
- ctypes wrappers for HIP, CUDA, and OpenCL☆129Updated 8 months ago
- A CLI for extracting libraries from Apple's dyld shared cache file☆483Updated last year
- Emulating double-precision arithmetic on Apple GPUs☆49Updated last year
- Everything we learnt from hacking Arm Mali GPUs.☆170Updated 5 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆73Updated 2 weeks ago