dougallj / applegpuLinks
Apple G13 GPU architecture docs and tools
☆596Updated last month
Alternatives and similar repositories for applegpu
Users that are interested in applegpu are comparing it to the libraries listed below
Sorting:
- Apple GPU microarchitecture☆530Updated 9 months ago
- Apple AMX Instruction Set☆1,098Updated 6 months ago
- Apple Firestorm/Icestorm CPU microarchitecture docs☆242Updated 2 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,009Updated 3 years ago
- CLI Tools For ANE☆121Updated 4 years ago
- Reverse engineering Rosetta 2 on M1 Mac☆414Updated 3 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆184Updated 8 months ago
- Nvidia Instruction Set Specification Generator☆280Updated last year
- ☆286Updated 6 months ago
- ☆448Updated 3 months ago
- BLIS fork with kernels for Apple M1. (Perhaps) The first open-source BLAS with Apple Matrix Coprocessor support.☆35Updated 2 years ago
- Kernel extension that enables TSO for Apple silicon processes☆264Updated 2 years ago
- Sniff CUDA ioctls☆196Updated 2 years ago
- It's a core. Made on Twitch.☆261Updated 3 years ago
- Everything we learnt from hacking Arm Mali GPUs.☆189Updated 9 months ago
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- GPUOcelot: A dynamic compilation framework for PTX☆201Updated 5 months ago
- Run a CoreML MLModel on the Asahi Neural Engine☆54Updated 2 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆105Updated 4 months ago
- RDNA3 emulator☆54Updated 2 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆172Updated 3 years ago
- Running linear algebra as fast as possible on Apple silicon☆20Updated last year
- Preloader for Linux on M1☆99Updated 4 years ago
- A tool and a library for bi-directional translation between SPIR-V and LLVM IR☆564Updated this week
- Measures microarchitectural details such as ROB size. Like https://github.com/travisdowns/robsize but without runtime code generation, wh…☆129Updated 4 years ago
- A new (MLIR based) high-level IR for clang.☆510Updated this week
- Metal-cpp is a low-overhead C++ interface for Metal that helps developers add Metal functionality to graphics apps, games, and game engin…☆317Updated 6 months ago
- Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.☆706Updated last month
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆675Updated 2 weeks ago
- Test Apple Neural Engine☆36Updated 6 years ago