dougallj / applegpuLinks
Apple G13 GPU architecture docs and tools
☆642Updated 8 months ago
Alternatives and similar repositories for applegpu
Users that are interested in applegpu are comparing it to the libraries listed below
Sorting:
- Apple AMX Instruction Set☆1,193Updated last year
- Apple GPU microarchitecture☆578Updated last year
- Apple Firestorm/Icestorm CPU microarchitecture docs☆251Updated 2 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,018Updated 3 years ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆454Updated last year
- Exploring the scalable matrix extension of the Apple M4 processor☆221Updated last year
- CLI Tools For ANE☆125Updated 4 years ago
- ☆312Updated 4 months ago
- Reverse engineering Rosetta 2 on M1 Mac☆426Updated 4 years ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆36Updated 3 years ago
- Nvidia Instruction Set Specification Generator☆311Updated last year
- Kernel extension that enables TSO for Apple silicon processes☆265Updated 2 years ago
- ☆450Updated 10 months ago
- Library to manipulate Apple Metal Shading Language IR☆57Updated 7 months ago
- Run a CoreML MLModel on the Asahi Neural Engine☆60Updated 2 years ago
- Everything we learnt from hacking Arm Mali GPUs.☆209Updated last year
- Emulating double-precision arithmetic on Apple GPUs☆58Updated 2 years ago
- Sniff CUDA ioctls☆224Updated 2 years ago
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆132Updated 5 years ago
- Kernel Extension allows to pin thread on a certain cpu core on Apple Silicon machines☆20Updated last year
- A tool and a library for bi-directional translation between SPIR-V and LLVM IR☆590Updated last week
- a quick and dirty little program to convert Apple CoreML model to ANE hwx file☆36Updated 2 weeks ago
- A profiler to disclose and quantify hardware features on GPUs.☆175Updated 3 years ago
- Tool for messing around with Apple GPU assembly☆27Updated 5 years ago
- Instruction latency & throughput profiler for AArch64☆42Updated 5 months ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆757Updated this week
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆701Updated last week
- Running linear algebra as fast as possible on Apple silicon☆28Updated 2 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆201Updated 6 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated last year