CoffeeBeforeArch / bits_of_architecture
Slides from the "Bits of Architecture" series on YouTube
☆22Updated 2 years ago
Alternatives and similar repositories for bits_of_architecture:
Users that are interested in bits_of_architecture are comparing it to the libraries listed below
- X86 CPU topics overview for developers , oriented towards performance☆197Updated 3 weeks ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆16Updated 5 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆167Updated 2 years ago
- Slides and other materials from CppCon2021☆103Updated last year
- ☆91Updated 2 years ago
- Serial and parallel implementations of matrix multiplication☆40Updated 4 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- A simple trace-based cache simulator☆12Updated 2 months ago
- Code examples for tutoring modern C++☆93Updated last month
- SYCL Reference Manual☆27Updated 11 months ago
- ROB size testing utility☆144Updated 3 years ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆99Updated last week
- Companion Repository for the Lecture Slides for the Clang Libraries☆99Updated last week
- Task graph-based asynchronous programming system using C++ coroutine☆89Updated last year
- ☆56Updated last week
- SYCL Conformance Tests☆68Updated last week
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 5 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆76Updated 3 weeks ago
- Trying to figure various CPU things out☆75Updated last year
- A simplified cache simulator for instructional purposes☆12Updated 4 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆117Updated 2 months ago
- ☆51Updated 5 years ago
- Source Code for 'Modern Arm Assembly Language Programming' by Daniel Kusswurm☆89Updated 3 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last week
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆117Updated 2 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆96Updated 11 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆37Updated 3 years ago
- The University of Bristol HPC Simulation Engine☆96Updated 2 weeks ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- a CUDA implementation of a priority queue☆84Updated 4 years ago