libxsmm / libxsmm-dnnLinks
Reference implementation of Deep Neural Network primitives using LIBXSMM's Tensor Processing Primitives (TPP)
☆12Updated last month
Alternatives and similar repositories for libxsmm-dnn
Users that are interested in libxsmm-dnn are comparing it to the libraries listed below
Sorting:
- Compute applications.☆24Updated 5 years ago
- SYCL Reference Manual☆28Updated last year
- A CUTLASS implementation using SYCL☆23Updated last week
- SYCL Benchmark Suite☆64Updated 3 months ago
- ☆41Updated 2 weeks ago
- A Benchmark Suite for Heterogeneous System Computation☆53Updated 3 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated last week
- ROCm SPARSE marshalling library☆67Updated this week
- development repository for the open earth compiler☆80Updated 4 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 7 months ago
- ☆44Updated 4 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆52Updated 2 months ago
- ☆55Updated 6 years ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆50Updated 9 months ago
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆21Updated 6 months ago
- A unified framework across multiple programming platforms☆38Updated last week
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- Simplified Interface to Complex Memory☆28Updated last year
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated last week
- Set of OpenCL microbenchmarks☆29Updated last year
- oneAPI Level Zero Conformance & Performance test content☆54Updated last week
- Performance Prediction Toolkit☆52Updated 5 months ago
- GPU Performance Advisor☆65Updated 2 years ago
- Flexible GPGPU instrumentation☆87Updated 5 years ago
- Linux Cross-Memory Attach☆94Updated 8 months ago
- ☆15Updated last month
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆39Updated 3 years ago
- TLB Benchmarks☆34Updated 7 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated last week
- ☆17Updated 3 years ago