arneish / parallel-PCA-openmpLinks
A parallelized implementation of Principal Component Analysis (PCA) using Singular Value Decomposition (SVD) in OpenMP for C. The procedure used is Modified Gram Schmidt algorithm. The method for Classical Gram Schmidt is also available for use.
☆17Updated 6 years ago
Alternatives and similar repositories for parallel-PCA-openmp
Users that are interested in parallel-PCA-openmp are comparing it to the libraries listed below
Sorting:
- OpenMP-based parallel program for counting the number of triangles in a sparse graph☆18Updated 7 years ago
- pyCUDA implementation of forward propagation for Convolutional Neural Networks☆18Updated 6 years ago
- A collection of string sorting algorithms☆55Updated 3 years ago
- External memory suffix array construction using inducing☆11Updated 4 years ago
- Highly parallel DBSCAN (HPDBSCAN)☆44Updated 9 months ago
- Introduction to CUDA programming☆122Updated 8 years ago
- A red-black self-balancing interval tree☆30Updated 4 years ago
- Parallel network flows using OpenMP and CUDA.☆28Updated 6 years ago
- OpenMP-based parallel software for computing the truss decomposition of a graph.☆14Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- ☆32Updated 4 years ago
- CUDA implementation of parallel Depth First Search (DFS) algorithm and it's comparison with a serial C++ DFS implementation.☆29Updated 7 years ago
- Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI☆56Updated 3 years ago
- A simple C++ fibonacci heap implementation☆31Updated 6 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆56Updated 3 years ago
- Bitonic Sort for C and CUDA☆16Updated 6 years ago
- Giddy - A lightweight GPU decompression library☆42Updated 5 years ago
- Medusa: Building GPU-based Parallel Sparse Graph Applications with Sequential C/C++ Code☆61Updated 4 years ago
- Fast & memory efficient Principal Components Analysis☆8Updated 10 years ago
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆27Updated 6 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆50Updated 4 years ago
- Lock-free parallel disjoint set data structure (aka UNION-FIND) with path compression and union by rank☆65Updated 10 years ago
- Problem: LU Factorization using OpenMP and MPI: study of scalability.☆15Updated 11 years ago
- A GPU accelerated error-bounded lossy compression for scientific data.☆78Updated last month
- a CUDA implementation of a priority queue☆84Updated 4 years ago
- GBBS: Graph Based Benchmark Suite☆211Updated 6 months ago
- Online construction of run-length BWT (RLBWT) and r-index. Plus, online LZ77 parsing based on RLBWT.☆15Updated 7 years ago
- C++ implementation of concurrent Binary Search Trees☆72Updated 9 years ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆14Updated 2 months ago
- A library with space-filling curve algorithms (analysis, neighbor-finding, visualization) and other utilities (math, geometry, image proc…☆24Updated 7 years ago