rohithj / Xeon-CafPhi
Caffe deep learning framework - optimized for Xeon Phi
☆14Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for Xeon-CafPhi
- GPU implementation of classical molecular dynamics proxy application.☆30Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Compute applications.☆25Updated 4 years ago
- Fork of magma to include more BLAS☆28Updated 7 years ago
- Reference workloads for modern deep learning methods.☆73Updated last year
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 5 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆25Updated 9 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- ☆58Updated 2 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Nitro Autotuning Framework☆9Updated 8 years ago
- sparse matrix pre-processing library☆81Updated 6 months ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆13Updated last month
- Scientific library for high-precision computations and research☆50Updated 7 years ago
- Caffe: a fast open framework for deep learning.☆14Updated 9 years ago
- Dolphin - a Deep Learning on MIC architecture Project.☆25Updated 10 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆13Updated 9 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 6 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 8 years ago
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- A fast and highly scalable GPU dynamic memory allocator☆103Updated 9 years ago
- A framework that helps implementing swizzle GPU kernels☆41Updated 4 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Updated 9 years ago