libxsmm / libxsmm-dnn
Reference implementation of Deep Neural Network primitives using LIBXSMM's Tensor Processing Primitives (TPP)
☆12Updated last month
Alternatives and similar repositories for libxsmm-dnn
Users that are interested in libxsmm-dnn are comparing it to the libraries listed below
Sorting:
- A Benchmark Suite for Heterogeneous System Computation☆53Updated 2 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated this week
- ☆18Updated last year
- SYCL Benchmark Suite☆64Updated 2 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 7 months ago
- ☆17Updated 3 years ago
- ☆60Updated 5 months ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆19Updated last year
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated last week
- GPU Performance Advisor☆65Updated 2 years ago
- ROCm SPARSE marshalling library☆67Updated this week
- ☆43Updated 4 years ago
- Official BOLT Repository☆28Updated 9 months ago
- Simplified Interface to Complex Memory☆28Updated last year
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- A unified framework across multiple programming platforms☆37Updated 10 months ago
- SYCL Reference Manual☆27Updated last year
- BLAS implementation for Intel FPGA