CoffeeBeforeArch / bits_of_architectureLinks
Slides from the "Bits of Architecture" series on YouTube
☆28Updated 3 years ago
Alternatives and similar repositories for bits_of_architecture
Users that are interested in bits_of_architecture are comparing it to the libraries listed below
Sorting:
- X86 CPU topics overview for developers , oriented towards performance☆202Updated 9 months ago
- ☆119Updated 2 years ago
- Omnitrace: Application Profiling, Tracing, and Analysis☆335Updated last week
- A profiler to disclose and quantify hardware features on GPUs.☆175Updated 3 years ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆281Updated 8 months ago
- MLIR-based toolkit targeting intel heterogeneous hardware☆49Updated 9 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆137Updated 11 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆173Updated 4 months ago
- Serial and parallel implementations of matrix multiplication☆44Updated 4 years ago
- Learn LLVM 17, published by Packt☆210Updated last year
- MLIR Sample dialect☆132Updated 9 months ago
- Slides and other materials from CppCon 2022☆559Updated 3 months ago
- Demonstration of various hardware effects on CUDA GPUs.☆390Updated 2 years ago
- Gallatin is a general-purpose memory manager for CUDA that allows for threads to quickly malloc and free memory of arbitrary size inside …☆25Updated 2 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆218Updated 10 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- Code examples for tutoring modern C++☆99Updated 4 months ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆125Updated 2 years ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆86Updated last month
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆142Updated 5 months ago
- C++ files from the "C++ Crash Course" YouTube series by CoffeeBeforeArch☆107Updated 3 years ago
- Learn OpenMP examples step by step☆101Updated 10 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated this week
- ☆160Updated this week
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Updated 2 years ago
- ☆288Updated 2 months ago
- ☆54Updated 6 years ago
- Slides and other materials from CppCon2021☆117Updated 2 years ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆122Updated last year
- Tutorial on building a gpu compiler backend in LLVM☆49Updated 11 months ago