eknight7 / ParallelRNNLinks
Final Project for Parallel Computing at CMU (15-618/15-418)
☆10Updated 9 years ago
Alternatives and similar repositories for ParallelRNN
Users that are interested in ParallelRNN are comparing it to the libraries listed below
Sorting:
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 9 years ago
- Computing Language Utility☆72Updated 9 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- Benchmarking matrix multiplication implementations☆103Updated 9 years ago
- Caffe deep learning framework - optimized for Xeon Phi☆14Updated 10 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆78Updated 5 years ago
- ☆74Updated 2 years ago
- A fast and highly scalable GPU dynamic memory allocator☆112Updated 10 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- Vectorized intersections (research code)☆16Updated 9 years ago
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- GPUfs - File system support for NVIDIA GPUs☆99Updated 7 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Updated 6 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆300Updated 7 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Updated 2 years ago
- A portable high-level API with CUDA or OpenCL back-end☆55Updated 8 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 8 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆38Updated 10 years ago
- Mirror JPEG compression and decompression accelerated on GPU☆82Updated 11 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- CNNs in Halide☆23Updated 10 years ago
- ☆101Updated 6 years ago
- ☆10Updated 3 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 12 years ago
- Project ARES represents a joint effort between LANL and ORNL to introduce a common compiler representation and tool-chain for HPC applica…☆10Updated 9 years ago
- Easy to run kernels using OpenCL☆187Updated 9 months ago
- High optimized fft library based on CUDA(the same fast as cufft and faster some times)☆19Updated 8 years ago
- GCN ISA assembler tool for my GSoC project at Openwall☆35Updated 10 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago