vectorclass / testbench
Test bench and scripts for testing VCL
☆9Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for testbench
- C++ vector class library, version 1☆24Updated 2 years ago
- Miscellaneous files relating to Vector class library☆9Updated 2 years ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 8 months ago
- Portable 128-bit SIMD intrinsics☆55Updated last year
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆64Updated 4 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 8 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Manual for the C++ vector class library☆29Updated 11 months ago
- This repository contains my experiments with compression-related algorithms☆35Updated 8 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆48Updated 6 months ago
- Add-on packages for Vector class library☆71Updated 11 months ago
- Profiling Taskflow Programs through Visualization☆47Updated last year
- Tests and benchmarks for cudnn (and in the future, other nvidia libraries)☆53Updated 3 years ago
- Mirror of the Cephes C source for reference☆86Updated 10 months ago
- OpenCL tool to detect buffer overflows in GPU kernels☆20Updated 5 years ago
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆71Updated 4 years ago
- library which simplifies host-GPU data transfer using userspace pagefault handling☆15Updated 12 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- Tools and extensions for CUDA profiling☆63Updated 4 years ago
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- SYCL Reference Manual☆25Updated 6 months ago
- ☆75Updated last year
- Tools for parsing, assembling, and disassembling HSAIL.☆70Updated 4 years ago
- ☆54Updated 2 weeks ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- High-level C++ for Accelerator Clusters☆142Updated this week
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- Enable Polyhedral JIT compilation☆9Updated 6 years ago