satishphd / Teaching-Intel-Intrinsics-for-SIMD-Parallelism
Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class
☆14Updated 2 months ago
Alternatives and similar repositories for Teaching-Intel-Intrinsics-for-SIMD-Parallelism:
Users that are interested in Teaching-Intel-Intrinsics-for-SIMD-Parallelism are comparing it to the libraries listed below
- ☆56Updated last month
- CDSChecker: A Model Checker for C11 and C++11 Atomics☆29Updated 11 years ago
- A benchmark for cache efficient data structures.☆30Updated 6 years ago
- A fast implementation of log() and exp()☆54Updated 2 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆133Updated 5 years ago
- A Scalable, Portable, and Memory-Efficient Lock-Free FIFO Queue (DISC '19)☆55Updated last year
- SYCL Reference Manual☆27Updated 11 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last month
- A minimal (really) out-of-tree MLIR example☆44Updated last week
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago
- Programatically obtain information about the pages backing a given memory region☆75Updated 3 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆59Updated 6 months ago
- Little OpenMP Library☆160Updated 2 years ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- InstLatX64_Demo☆43Updated last week
- ☆22Updated 8 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated 2 years ago
- Library for lock-free locks☆77Updated last year
- Companion Repository for the Lecture Slides for the Clang Libraries☆100Updated last month
- The Berkeley Container Library☆124Updated last year
- Interchangeable AoS and SoA containers☆22Updated 2 years ago
- ☆28Updated 2 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆96Updated 11 months ago
- NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.☆46Updated 9 months ago
- A benchmark for standard libraries☆22Updated last year
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆41Updated 3 months ago
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) API☆103Updated last month
- A C++ memory pool that is Boost-friendly and performance oriented (zero-malloc).☆22Updated 2 weeks ago
- Generate SQL from TableGen code - This is part of the tutorial "How to write a TableGen backend" in 2021 LLVM Developers' Meeting.☆29Updated 2 years ago
- CPU Ultimate Latency Test.☆109Updated 2 years ago