CoffeeBeforeArch / bits_of_architectureLinks

Slides from the "Bits of Architecture" series on YouTube

☆23

Alternatives and similar repositories for bits_of_architecture

Users that are interested in bits_of_architecture are comparing it to the libraries listed below

Sorting:

CoffeeBeforeArch / spring_2020_tutorial
"Hardware, Software, and Compilers! Oh My!" tutorial files
☆16Updated 5 years ago
ProjectPhysX / PTXprofiler
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
☆52Updated 2 months ago
banach-space / cpp-tutor
Code examples for tutoring modern C++
☆96Updated last week
mdadams / clang_libraries_companion
Companion Repository for the Lecture Slides for the Clang Libraries
☆100Updated 2 months ago
boostcon / cppnow_presentations_2023
☆86Updated last year
amd / amd-lab-notes
AMD lab notes with code examples to demonstrate use of AMD GPUs
☆98Updated 11 months ago
CoffeeBeforeArch / cache_simulator
A simple trace-based cache simulator
☆13Updated 5 months ago
GPUOpen-Tools / isa_spec_manager
Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.
☆33Updated 3 months ago
intel / vc-intrinsics
☆57Updated last week
vortexgpgpu / NVPTX-SPIRV-Translator
The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.
☆39Updated 3 years ago
ychen306 / vegen
☆29Updated 2 years ago
ashvardanian / ParallelReductionsBenchmark
Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!
☆98Updated last week
microsoft / ArchProbe
A profiler to disclose and quantify hardware features on GPUs.
☆170Updated 3 years ago
bashbaug / SimpleOpenCLSamples
Simple OpenCL Samples that Build with Khronos Headers and Libs
☆105Updated last month
spcl / haystack
Haystack is an analytical cache model that given a program computes the number of cache misses.
☆46Updated 5 years ago
mabdullahsoyturk / HPC-Paper-Notes
My notes on various HPC papers.
☆22Updated 2 years ago
KhronosGroup / SYCL_Reference
SYCL Reference Manual
☆28Updated last year
CoffeeBeforeArch / parallel_cpp
☆98Updated 2 years ago
ROCm / omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
☆312Updated last week
wcohen / libpfm4
This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…
☆64Updated 7 months ago
CUDACommunity / CUDACommunityMeetup2021
☆23Updated 3 years ago
unisa-hpc / sycl-bench
SYCL Benchmark Suite
☆64Updated 3 months ago
CoffeeBeforeArch / cpp_20_samples
Code examples using new features from C++20
☆35Updated 4 years ago
gthparch / macsim
A heterogeneous architecture timing model simulator.
☆156Updated 5 months ago
hkust-adsl / gass
☆35Updated 3 years ago
intel / ittapi
Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs
☆113Updated last week
decodecudabinary / Decoding-CUDA-Binary
☆52Updated 5 years ago
intel / graph-compiler
MLIR-based toolkit targeting intel heterogeneous hardware
☆44Updated 3 months ago
dian-lun-lin / taro
Task graph-based asynchronous programming system using C++ coroutine
☆90Updated last year
OpenGPGPU / opengpgpu
☆68Updated 7 months ago