Apple GPU microarchitecture
☆579Sep 22, 2024Updated last year
Alternatives and similar repositories for metal-benchmarks
Users that are interested in metal-benchmarks are comparing it to the libraries listed below
Sorting:
- Apple G13 GPU architecture docs and tools☆646May 16, 2025Updated 9 months ago
- Print all known information about the GPU on Apple-designed chips☆97Oct 29, 2025Updated 4 months ago
- FlashAttention (Metal Port)☆589Sep 22, 2024Updated last year
- Apple AMX Instruction Set☆1,198Dec 26, 2024Updated last year
- Emulating double-precision arithmetic on Apple GPUs☆57May 17, 2023Updated 2 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Dec 4, 2022Updated 3 years ago
- State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoreti…☆431Dec 14, 2024Updated last year
- Tool for messing around with Apple GPU assembly☆27Jan 24, 2021Updated 5 years ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆474Mar 12, 2024Updated last year
- Extract Metal functions from .metallib files.☆177May 24, 2023Updated 2 years ago
- Library to manipulate Apple Metal Shading Language IR☆57Jun 28, 2025Updated 8 months ago
- Everything we actually know about the Apple Neural Engine (ANE)☆2,408Oct 21, 2025Updated 4 months ago
- Fundamentals of physically based rendering with Metal 4☆26Jul 7, 2025Updated 8 months ago
- A set of extensions and utilities to work with CoreVideo types.☆27Jul 3, 2024Updated last year
- A demonstration of how to play HDR video with Metal and AVFoundation☆23Mar 10, 2025Updated last year
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,698Apr 25, 2023Updated 2 years ago
- A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp…☆284Jan 29, 2025Updated last year
- Nvidia Instruction Set Specification Generator☆314Jul 9, 2024Updated last year
- Tools and samples for understanding Apple's Metal shading language and its LLVM Bitcode shader files☆87Jun 29, 2023Updated 2 years ago
- Implementation of "Efficiency-Aware Russian Roulette and Splitting" (SIGGRAPH 2022)☆49Jun 6, 2023Updated 2 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆176May 15, 2022Updated 3 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆222Nov 7, 2024Updated last year
- ☆338Sep 25, 2025Updated 5 months ago
- A simple demonstration of Metal 3.0 mesh shaders☆59Mar 31, 2023Updated 2 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,021Apr 4, 2022Updated 3 years ago
- "Learn Metal with C++" samples, ported to iOS☆198Sep 9, 2025Updated 6 months ago
- Running linear algebra as fast as possible on Apple silicon☆28Aug 18, 2023Updated 2 years ago
- A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.☆338Mar 1, 2026Updated last week
- Type safety for Metal 🤘☆40May 2, 2025Updated 10 months ago
- A Swift framework that simplifies working with Apple's Metal API.☆79Feb 7, 2025Updated last year
- Solve Puzzles. Learn Metal 🤘☆598Sep 24, 2024Updated last year
- Test Apple Neural Engine☆37Nov 10, 2018Updated 7 years ago
- Source code for the paper "ReSTIR Subsurface Scattering for Real-Time Path Tracing" (HPG 2024)☆51Dec 22, 2024Updated last year
- Specification and reference implementation for the OpenPBR Surface shading model☆660Mar 3, 2026Updated last week
- SH for HLSL 2021☆240Mar 1, 2025Updated last year
- Drawing graphics efficiently on Apple Vision using the Metal rendering API☆301Jul 9, 2025Updated 8 months ago
- A minimal example of rendering an immersive spatial experience with Metal, ARKit, and visionOS Compositing Services☆223Jun 30, 2024Updated last year
- Fast O(1) offset allocator with minimal fragmentation☆1,019Apr 30, 2024Updated last year
- Alternate lighting models using custom SCNProgram for SceneKit☆15Jan 25, 2023Updated 3 years ago