CoffeeBeforeArch / bits_of_architectureLinks
Slides from the "Bits of Architecture" series on YouTube
☆28Updated 3 years ago
Alternatives and similar repositories for bits_of_architecture
Users that are interested in bits_of_architecture are comparing it to the libraries listed below
Sorting:
- ☆121Updated 2 years ago
- Omnitrace: Application Profiling, Tracing, and Analysis☆340Updated this week
- X86 CPU topics overview for developers , oriented towards performance☆203Updated last week
- Learn LLVM 17, published by Packt☆212Updated last year
- MLIR Sample dialect☆135Updated 2 weeks ago
- The University of Bristol HPC Simulation Engine☆104Updated 4 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆193Updated 5 months ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆87Updated 2 months ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆282Updated 9 months ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆197Updated this week
- Demonstration of various hardware effects on CUDA GPUs.☆390Updated 2 years ago
- ☆161Updated this week
- A highly-flexible GPU simulator for AMD GPUs.☆207Updated this week
- MLIR-based toolkit targeting intel heterogeneous hardware☆49Updated 10 months ago
- ☆197Updated this week
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆143Updated 6 months ago
- Slides and other materials from CppCon2021☆118Updated 2 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆147Updated 3 weeks ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Updated 2 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆139Updated last year
- Trying to figure various CPU things out☆90Updated 3 weeks ago
- Serial and parallel implementations of matrix multiplication☆44Updated 4 years ago
- Companion Repository for the Lecture Slides for the Clang Libraries☆122Updated 3 months ago
- ☆25Updated last year
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆109Updated last year
- Conversions to MLIR EmitC☆134Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆31Updated this week
- A profiler to disclose and quantify hardware features on GPUs.☆175Updated 3 years ago
- Tutorial on building a gpu compiler backend in LLVM☆50Updated last year