Spark MLlib code optimized to efficiently support sparse data
☆51Dec 22, 2016Updated 9 years ago
Alternatives and similar repositories for SparseML
Users that are interested in SparseML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Assembly of fundamental statistics implemented based on Apache Spark☆31Feb 11, 2016Updated 10 years ago
- Topic Modeling on Apache Spark☆94Mar 1, 2019Updated 7 years ago
- The machine learning component of Open Network Insight: scalable analytics combining spark for big data and C / MPI for high performance …☆13Nov 9, 2016Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆170Feb 6, 2017Updated 9 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Core HW bindings and optimizations for BigDL☆37Nov 24, 2025Updated 6 months ago
- ☆21Oct 13, 2016Updated 9 years ago
- ☆20Dec 1, 2016Updated 9 years ago
- Spark implementation of Ford-Fulkerson algorithm☆14Feb 11, 2018Updated 8 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 10 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Jan 17, 2016Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆85Jan 2, 2023Updated 3 years ago
- Scala client for InfluxDB☆22Nov 15, 2022Updated 3 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆110Dec 28, 2016Updated 9 years ago
- A scalable machine learning library on Apache Spark☆797Aug 30, 2021Updated 4 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago
- Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference☆74Sep 28, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TensorFlow on Spark☆296Oct 19, 2017Updated 8 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆169Nov 17, 2018Updated 7 years ago
- Import and export TensorFlow records from/to Spark☆18Jul 7, 2017Updated 8 years ago
- Distributed Matrix Library☆73Jan 28, 2017Updated 9 years ago
- HBase Indexer - indexing HBase to Solr 5.x and higher☆13Oct 27, 2017Updated 8 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆182Jan 10, 2018Updated 8 years ago
- An API for Distributed Machine Learning☆156Sep 22, 2016Updated 9 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆109Mar 25, 2016Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- An engineering report on using transactions in Kafka 0.11.0.0☆19Feb 27, 2018Updated 8 years ago
- A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark☆18Jul 7, 2016Updated 9 years ago
- Splash Project for parallel stochastic learning☆93Jun 16, 2017Updated 8 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Nov 3, 2016Updated 9 years ago
- Elastic Search on Spark☆111Oct 21, 2014Updated 11 years ago
- ☆12May 16, 2017Updated 9 years ago