CoffeeBeforeArch / bits_of_architecture
Slides from the "Bits of Architecture" series on YouTube
☆21Updated 2 years ago
Alternatives and similar repositories for bits_of_architecture:
Users that are interested in bits_of_architecture are comparing it to the libraries listed below
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆17Updated 4 years ago
- Companion Repository for the Lecture Slides for the Clang Libraries☆88Updated 10 months ago
- Code examples for tutoring modern C++☆91Updated last month
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆47Updated last year
- A simple trace-based cache simulator☆10Updated 2 weeks ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆109Updated 2 years ago
- Slides and other materials from CppCon2021☆100Updated last year
- ☆56Updated 2 weeks ago
- A profiler to disclose and quantify hardware features on GPUs.☆165Updated 2 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 2 months ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆97Updated 8 months ago
- Omnitrace: Application Profiling, Tracing, and Analysis☆307Updated last week
- C++ files from the "C++ Crash Course" YouTube series by CoffeeBeforeArch☆100Updated 2 years ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆96Updated this week
- X86 CPU topics overview for developers , oriented towards performance☆194Updated 3 months ago
- SYCL Reference Manual☆27Updated 8 months ago
- ☆83Updated last year
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆120Updated 5 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆47Updated 2 months ago
- ROB size testing utility☆140Updated 3 years ago
- ☆51Updated 5 years ago
- Clang supporting syntax plugins☆18Updated 2 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆71Updated 9 years ago
- Graphics Processing Unit (GPU) Architecture Guide☆172Updated 2 years ago
- LLVM (Low Level Virtual Machine) Guide. Learn all about the compiler infrastructure, which is designed for compile-time, link-time, run-t…☆156Updated last year
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆23Updated 4 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆126Updated this week
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆126Updated this week
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- A graphics tracing and replay framework to explore system-level effects on heterogeneous CPU+GPU memory systems.☆14Updated 6 years ago