Spark MLlib code optimized to efficiently support sparse data
☆51Dec 22, 2016Updated 9 years ago
Alternatives and similar repositories for SparseML
Users that are interested in SparseML are comparing it to the libraries listed below
Sorting:
- Assembly of fundamental statistics implemented based on Apache Spark☆31Feb 11, 2016Updated 10 years ago
- Topic Modeling on Apache Spark☆94Mar 1, 2019Updated 7 years ago
- The machine learning component of Open Network Insight: scalable analytics combining spark for big data and C / MPI for high performance …☆13Nov 9, 2016Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆169Feb 6, 2017Updated 9 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Core HW bindings and optimizations for BigDL☆37Nov 24, 2025Updated 3 months ago
- ☆20Dec 1, 2016Updated 9 years ago
- Spark implementation of Ford-Fulkerson algorithm☆14Feb 11, 2018Updated 8 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Jan 17, 2016Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Docker containers with Apache Accumulo and Apache Spark environment.☆12Jan 22, 2016Updated 10 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- ☆25Nov 8, 2019Updated 6 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆109Dec 28, 2016Updated 9 years ago
- Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…☆111Jun 14, 2017Updated 8 years ago
- A scalable machine learning library on Apache Spark☆796Aug 30, 2021Updated 4 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago
- Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference☆74Sep 28, 2015Updated 10 years ago
- TensorFlow on Spark☆296Oct 19, 2017Updated 8 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Nov 17, 2018Updated 7 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- Import and export TensorFlow records from/to Spark☆18Jul 7, 2017Updated 8 years ago
- Distributed Matrix Library☆72Jan 28, 2017Updated 9 years ago
- HBase Indexer - indexing HBase to Solr 5.x and higher☆13Oct 27, 2017Updated 8 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆183Jan 10, 2018Updated 8 years ago
- An API for Distributed Machine Learning☆155Sep 22, 2016Updated 9 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- An engineering report on using transactions in Kafka 0.11.0.0☆19Feb 27, 2018Updated 8 years ago
- A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark☆18Jul 7, 2016Updated 9 years ago
- Splash Project for parallel stochastic learning☆93Jun 16, 2017Updated 8 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Nov 3, 2016Updated 9 years ago
- Elastic Search on Spark☆112Oct 21, 2014Updated 11 years ago
- ☆12May 16, 2017Updated 8 years ago