google / memcpy-gemm
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for memcpy-gemm
- ☆84Updated this week
- Portable 128-bit SIMD intrinsics☆57Updated last year
- ☆54Updated this week
- ☆16Updated 4 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆49Updated 7 months ago
- ☆21Updated 2 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- Information about AVX-512 support on recent Intel processors☆43Updated 2 years ago
- SYCL Reference Manual☆26Updated 6 months ago
- Tests and benchmarks for cudnn (and in the future, other nvidia libraries)☆53Updated 4 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- Pybind11 bindings for the Abseil C++ Common Libraries☆24Updated last month
- Fast and simple algorithms for computing both LCSk and LCSk+☆21Updated 6 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆34Updated 9 years ago
- AMD’s C++ library for accelerating tensor primitives☆35Updated this week
- Random number library that generate pseudo-random and quasi-random numbers.☆24Updated this week
- CorrelationVector-Cpp provides a reference C++ implementation of the CorrelationVector protocol for tracing and correlation of events thr…☆17Updated 2 years ago
- CUDA Template Functions☆18Updated 3 months ago
- ☆33Updated last year
- This repository provides code for SVD and Importance sampling-based algorithms for large scale topic modeling.☆13Updated 3 years ago
- ☆14Updated last week
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated last year
- ☆11Updated 3 years ago
- ☆88Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆80Updated this week
- This repository contains my experiments with compression-related algorithms☆35Updated 8 years ago
- Vendored files from Intel's SVML☆37Updated 10 months ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆35Updated 5 years ago
- Tools and extensions for CUDA profiling☆63Updated 4 years ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆55Updated this week