dougallj / applegpuLinks
Apple G13 GPU architecture docs and tools
☆616Updated 4 months ago
Alternatives and similar repositories for applegpu
Users that are interested in applegpu are comparing it to the libraries listed below
Sorting:
- Apple GPU microarchitecture☆550Updated last year
- Apple AMX Instruction Set☆1,152Updated 9 months ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆243Updated 2 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,012Updated 3 years ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆424Updated last year
- Reverse engineering Rosetta 2 on M1 Mac☆418Updated 4 years ago
- CLI Tools For ANE☆121Updated 4 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆204Updated 11 months ago
- ☆299Updated last week
- Nvidia Instruction Set Specification Generator☆293Updated last year
- Library to manipulate Apple Metal Shading Language IR☆54Updated 3 months ago
- Kernel extension that enables TSO for Apple silicon processes☆265Updated 2 years ago
- ☆449Updated 6 months ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆35Updated 2 years ago
- Run a CoreML MLModel on the Asahi Neural Engine☆55Updated 2 years ago
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆689Updated last week
- Everything we learnt from hacking Arm Mali GPUs.☆199Updated last year
- Sniff CUDA ioctls☆211Updated 2 years ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆131Updated 4 years ago
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆174Updated 3 years ago
- Tools for people envious of nvidia's blob driver.☆487Updated last year
- Instruction latency & throughput profiler for AArch64☆39Updated last month
- RDNA3 emulator☆54Updated 5 months ago
- It's a core. Made on Twitch.☆264Updated 3 years ago
- A tool and a library for bi-directional translation between SPIR-V and LLVM IR☆578Updated last week
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆145Updated 2 years ago
- Kernel Extension allows to pin thread on a certain cpu core on Apple Silicon machines☆19Updated 10 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆209Updated 7 months ago
- uops.info Code Analyzer☆290Updated last year