ShivayaDevs / Photops
CUDA based parallel Image processing tool
☆20Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for Photops
- ☆42Updated 6 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆56Updated last week
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 5 years ago
- A few cuda examples built with cmake☆23Updated 5 years ago
- Introduction to Parallel Programming class code☆31Updated 9 years ago
- Fast matrix multiplication☆28Updated 3 years ago
- ☆64Updated 10 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆41Updated 11 years ago
- CUDA by practice☆116Updated 4 years ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆17Updated 4 years ago
- Full-speed Array of Structures access☆160Updated last year
- Parallel network flows using OpenMP and CUDA.☆27Updated 5 years ago
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆26Updated 5 years ago
- Implementations of 2D Image Convolution algorithm with CUDA (using global memory, shared memory and constant memory)☆17Updated 6 years ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆24Updated 2 years ago
- kmeans clustering with multi-GPU capabilities☆116Updated last year
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆20Updated 6 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 3 months ago
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 8 years ago
- Parallel Programming☆28Updated 11 years ago
- Just some examples of using CUDA I have put together☆37Updated last year
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- MPI Tutorial Exercises☆43Updated 10 years ago
- CMake module collection☆30Updated 9 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆42Updated 10 months ago
- ☆54Updated last year
- A collection of awesome algorithms, implemented in CUDA.☆24Updated 6 years ago
- CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions☆51Updated 7 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 6 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago