CoffeeBeforeArch / bits_of_architectureLinks
Slides from the "Bits of Architecture" series on YouTube
☆28Updated 3 years ago
Alternatives and similar repositories for bits_of_architecture
Users that are interested in bits_of_architecture are comparing it to the libraries listed below
Sorting:
- X86 CPU topics overview for developers , oriented towards performance☆202Updated 8 months ago
- ☆116Updated 2 years ago
- Omnitrace: Application Profiling, Tracing, and Analysis☆334Updated last week
- Slides and other materials from CppCon 2022☆557Updated 2 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆160Updated 4 months ago
- Slides and other materials from CppCon2021☆116Updated 2 years ago
- Code examples for tutoring modern C++☆100Updated 3 months ago
- Advanced Matrix Extensions (AMX) Guide☆105Updated 3 years ago
- Demonstration of various hardware effects on CUDA GPUs.☆389Updated last year
- Learn LLVM 17, published by Packt☆209Updated last year
- Graphics Processing Unit (GPU) Architecture Guide☆248Updated 3 years ago
- A simple trace-based cache simulator☆16Updated 10 months ago
- A profiler to disclose and quantify hardware features on GPUs.☆174Updated 3 years ago
- Examples from the "C++ From Scratch" Series☆99Updated 2 years ago
- PTX-EMU is a simple emulator for CUDA program.☆38Updated 6 months ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Updated 2 years ago
- ☆203Updated 2 months ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆281Updated 7 months ago
- Slides and other materials from CppCon 2023☆330Updated last year
- Asynchronous Programming with C++, Published by Packt☆73Updated 11 months ago
- A lightweight memory allocator for hardware-accelerated machine learning☆174Updated last month
- NVIDIA tools guide☆147Updated 10 months ago
- Tutorial on building a gpu compiler backend in LLVM☆49Updated 10 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆55Updated 8 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆137Updated 10 months ago
- My notes on various HPC papers.☆24Updated 2 years ago
- ☆22Updated last year
- Nvidia Instruction Set Specification Generator☆298Updated last year
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆67Updated last year
- ☆71Updated last year