arneish / parallel-PCA-openmp
A parallelized implementation of Principal Component Analysis (PCA) using Singular Value Decomposition (SVD) in OpenMP for C. The procedure used is Modified Gram Schmidt algorithm. The method for Classical Gram Schmidt is also available for use.
☆17Updated 5 years ago
Alternatives and similar repositories for parallel-PCA-openmp:
Users that are interested in parallel-PCA-openmp are comparing it to the libraries listed below
- Shared Memory, Message Passing, and Hybrid Merge Sort: UPC, OpenMP, MPI and Hybrid Implementations☆14Updated last year
- Highly parallel DBSCAN (HPDBSCAN)☆40Updated 5 months ago
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆26Updated 5 years ago
- Fast & memory efficient Principal Components Analysis☆8Updated 9 years ago
- Sequential and Parallel(using Open MP and Pthreads) Implementations(c++) of the K Means Clustering Algorithm and visualizing the results …☆20Updated 6 years ago
- A Collection of Parallel Algorithms for Computational Geometry☆12Updated 2 years ago
- PCA implementation in c++☆36Updated 13 years ago
- A pure C++ implementation of the lowess algorithm using templates☆21Updated 9 years ago
- A collection of code examples for learning parallel programming concepts☆52Updated 4 years ago
- My curated list of C++ (GPU) BLAS libraries and machine learning/reinforcement learning frameworks☆25Updated 5 years ago
- std::map/std::set implementation using the adaptive radix tree☆13Updated 3 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆54Updated 3 years ago
- CUDA implementation of the Floyd-Warshall All pairs shortest path graph algorithm(with path reconstruction)☆38Updated 10 years ago
- Abstractions of memory, allocator, vector, tuple, shared_ptr, unique_ptr, bitset, variant and string working on both CPU and GPU☆30Updated last month
- GPU B-Tree with support for versioning (snapshots).☆47Updated 4 months ago
- Fastest CUDA RGB to grayscale: 5-30x faster than OpenCV. For image processing/computer vision.☆15Updated 3 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆35Updated 9 years ago
- A collection of string sorting algorithms☆55Updated 3 years ago
- Interleaving bits from two sources using SIMD instructions.☆14Updated 7 years ago
- C++ fast hierarchical clustering algorithms☆85Updated last year
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- A library with space-filling curve algorithms (analysis, neighbor-finding, visualization) and other utilities (math, geometry, image proc…☆24Updated 7 years ago
- AVX512F and AVX2 versions of quick sort☆105Updated 7 years ago
- Generalized Histograms for CUDA-capable GPUs☆43Updated 9 years ago
- c++ implementation of decision tree algorithm☆54Updated 5 years ago
- Principle Component Analysis☆12Updated 9 years ago
- ☆42Updated 7 years ago
- DSL for SIMD Sorting on AVX2 & AVX512☆30Updated 6 years ago