philipturner / metal-benchmarksView external linksLinks
Apple GPU microarchitecture
☆579Sep 22, 2024Updated last year
Alternatives and similar repositories for metal-benchmarks
Users that are interested in metal-benchmarks are comparing it to the libraries listed below
Sorting:
- Apple G13 GPU architecture docs and tools☆642May 16, 2025Updated 9 months ago
- Print all known information about the GPU on Apple-designed chips☆96Oct 29, 2025Updated 3 months ago
- FlashAttention (Metal Port)☆580Sep 22, 2024Updated last year
- Apple AMX Instruction Set☆1,194Dec 26, 2024Updated last year
- Emulating double-precision arithmetic on Apple GPUs☆58May 17, 2023Updated 2 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Dec 4, 2022Updated 3 years ago
- State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoreti…☆427Dec 14, 2024Updated last year
- Tool for messing around with Apple GPU assembly☆27Jan 24, 2021Updated 5 years ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆454Mar 12, 2024Updated last year
- Extract Metal functions from .metallib files.☆177May 24, 2023Updated 2 years ago
- Nanite on macOS☆65May 24, 2023Updated 2 years ago
- Everything we actually know about the Apple Neural Engine (ANE)☆2,360Oct 21, 2025Updated 3 months ago
- Fundamentals of physically based rendering with Metal 4☆21Jul 7, 2025Updated 7 months ago
- A set of extensions and utilities to work with CoreVideo types.☆26Jul 3, 2024Updated last year
- A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp…☆281Jan 29, 2025Updated last year
- Nvidia Instruction Set Specification Generator☆311Jul 9, 2024Updated last year
- Renderer for molecular nanotechnology☆90Jan 13, 2026Updated last month
- Implementation of "Efficiency-Aware Russian Roulette and Splitting" (SIGGRAPH 2022)☆49Jun 6, 2023Updated 2 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆175May 15, 2022Updated 3 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆221Nov 7, 2024Updated last year
- A simple demonstration of Metal 3.0 mesh shaders☆59Mar 31, 2023Updated 2 years ago
- Dissecting the M1's GPU for 3D acceleration☆1,019Apr 4, 2022Updated 3 years ago
- Running linear algebra as fast as possible on Apple silicon☆28Aug 18, 2023Updated 2 years ago
- ☆312Sep 25, 2025Updated 4 months ago
- A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.☆338Feb 9, 2026Updated last week
- Type safety for Metal 🤘☆39May 2, 2025Updated 9 months ago
- Solve Puzzles. Learn Metal 🤘☆598Sep 24, 2024Updated last year
- "Learn Metal with C++" samples, ported to iOS☆194Sep 9, 2025Updated 5 months ago
- Source code for the paper "ReSTIR Subsurface Scattering for Real-Time Path Tracing" (HPG 2024)☆50Dec 22, 2024Updated last year
- Specification and reference implementation for the OpenPBR Surface shading model☆657Feb 3, 2026Updated last week
- Drawing graphics efficiently on Apple Vision using the Metal rendering API☆300Jul 9, 2025Updated 7 months ago
- A minimal example of rendering an immersive spatial experience with Metal, ARKit, and visionOS Compositing Services☆223Jun 30, 2024Updated last year
- Fast O(1) offset allocator with minimal fragmentation☆1,013Apr 30, 2024Updated last year
- Alternate lighting models using custom SCNProgram for SceneKit☆15Jan 25, 2023Updated 3 years ago
- A collection of reverse-engineered documentation for the instruction sets for various generations of Mali GPU's.☆38Mar 10, 2018Updated 7 years ago
- ☆115Jan 18, 2024Updated 2 years ago
- Supplemental code accompanying Ray Tracing Gems II, Chapter 14: The Reference Path Tracer☆219May 21, 2025Updated 8 months ago
- ☆24Mar 6, 2023Updated 2 years ago
- MLX: An array framework for Apple silicon☆23,918Updated this week