qcri / sPCA
Scalable PCA (sPCA) is a scalable implementation of Principal component analysis algorithm on top of Spark
☆12Updated 9 years ago
Alternatives and similar repositories for sPCA:
Users that are interested in sPCA are comparing it to the libraries listed below
- Gaussian Mixture Model Implementation in Pyspark☆32Updated 10 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Distributed Matrix Library☆70Updated 8 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 9 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- PyMC version 3 (PyMC 2 is in branch 2.3)☆27Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- National Data Science Bowl☆20Updated 9 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 8 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Updated 9 years ago
- more composable than other neural network libraries☆42Updated 8 years ago
- Demo of random projections at BerlinBuzzwords 2015☆22Updated 4 years ago
- Splash Project for parallel stochastic learning☆94Updated 7 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Mirror of Apache Spark☆10Updated 8 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Updated 8 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Weighted matrix factorization on the GPU with Theano and scikits.cuda☆12Updated 10 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Library for GPU-related statistical functions☆84Updated 12 years ago
- ADMM on Apache Spark☆31Updated 9 years ago
- SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.☆47Updated 6 years ago
- analytics tool kit☆43Updated 8 years ago
- A scala-based feature generation and modeling framework☆61Updated 6 years ago
- Scalable inference for Correlated Topic Models☆30Updated 9 years ago
- A primal-dual framework for distributed L1-regularized optimization☆35Updated 8 years ago