arneish / parallel-PCA-openmp
A parallelized implementation of Principal Component Analysis (PCA) using Singular Value Decomposition (SVD) in OpenMP for C. The procedure used is Modified Gram Schmidt algorithm. The method for Classical Gram Schmidt is also available for use.
☆17Updated 5 years ago
Alternatives and similar repositories for parallel-PCA-openmp:
Users that are interested in parallel-PCA-openmp are comparing it to the libraries listed below
- Sequential and Parallel(using Open MP and Pthreads) Implementations(c++) of the K Means Clustering Algorithm and visualizing the results …☆21Updated 5 years ago
- Fast & memory efficient Principal Components Analysis☆8Updated 9 years ago
- Highly parallel DBSCAN (HPDBSCAN)☆39Updated 4 months ago
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆26Updated 5 years ago
- Fastest CUDA RGB to grayscale: 5-30x faster than OpenCV. For image processing/computer vision.☆15Updated 3 years ago
- Implementation of the maximum network flow problem in CUDA.☆28Updated 4 years ago
- Code accompanying my blog post on k-means in Python, C++ and CUDA☆58Updated 7 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆47Updated 3 years ago
- A C++ RangeTree implementation.☆40Updated 4 years ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 5 years ago
- C++ to OpenCL C Source-to-source Translation☆13Updated 10 years ago
- C++ fast hierarchical clustering algorithms☆84Updated last year
- Contains implementations of cache-optimized and external memory algorithms.☆11Updated 6 years ago
- ANN: A Library for Approximate Nearest Neighbor Searching☆31Updated last week
- c++ implementation of decision tree algorithm☆54Updated 4 years ago
- openmp examples☆141Updated 5 years ago
- Abstractions of memory, allocator, vector, tuple, shared_ptr, unique_ptr, bitset, variant and string working on both CPU and GPU☆31Updated 2 weeks ago
- CUDA implementation of the Floyd-Warshall All pairs shortest path graph algorithm(with path reconstruction)☆38Updated 10 years ago
- CUDA implementation of parallel Depth First Search (DFS) algorithm and it's comparison with a serial C++ DFS implementation.☆27Updated 6 years ago
- This repository contains components that will support percolation via OpenCL and CUDA☆31Updated 3 years ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆63Updated 9 years ago
- High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc.☆14Updated 8 months ago
- C++ Library for Portable SIMD Vectorization☆80Updated 2 months ago
- A Collection of Parallel Algorithms for Computational Geometry☆12Updated 2 years ago
- ☆20Updated 3 years ago
- GPU B-Tree with support for versioning (snapshots).☆45Updated 2 months ago
- Parallel computing implementation examples☆24Updated 7 years ago
- Set of basic classes (vector, matrix, images and memory array) for CPU and GPU☆17Updated 3 years ago
- A pure C++ implementation of the lowess algorithm using templates☆21Updated 9 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆36Updated 9 years ago