Spark MLlib code optimized to efficiently support sparse data
☆51Dec 22, 2016Updated 9 years ago
Alternatives and similar repositories for SparseML
Users that are interested in SparseML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Assembly of fundamental statistics implemented based on Apache Spark☆31Feb 11, 2016Updated 10 years ago
- Topic Modeling on Apache Spark☆94Mar 1, 2019Updated 7 years ago
- The machine learning component of Open Network Insight: scalable analytics combining spark for big data and C / MPI for high performance …☆13Nov 9, 2016Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆170Feb 6, 2017Updated 9 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Core HW bindings and optimizations for BigDL☆37Nov 24, 2025Updated 7 months ago
- ☆20Dec 1, 2016Updated 9 years ago
- Spark implementation of Ford-Fulkerson algorithm☆14Feb 11, 2018Updated 8 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 10 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆145Jan 26, 2016Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Jan 17, 2016Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Docker containers with Apache Accumulo and Apache Spark environment.☆12Jan 22, 2016Updated 10 years ago
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆85Jan 2, 2023Updated 3 years ago
- Scala client for InfluxDB☆22Nov 15, 2022Updated 3 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- ☆25Nov 8, 2019Updated 6 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆110Dec 28, 2016Updated 9 years ago
- Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…☆112Jun 14, 2017Updated 9 years ago
- A scalable machine learning library on Apache Spark☆797Aug 30, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Benchmarks of BLAS libraries with Scala interface☆30Jan 21, 2016Updated 10 years ago
- Scanning alive hosts of the given CIDR range in parallel.☆10May 8, 2025Updated last year
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆169Nov 17, 2018Updated 7 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 5 years ago
- Import and export TensorFlow records from/to Spark☆18Jul 7, 2017Updated 8 years ago
- Distributed Matrix Library☆73Jan 28, 2017Updated 9 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆182Jan 10, 2018Updated 8 years ago
- An API for Distributed Machine Learning☆156Sep 22, 2016Updated 9 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆109Mar 25, 2016Updated 10 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark☆18Jul 7, 2016Updated 9 years ago
- StanCon2018 Helsinki Tutorial☆30Sep 16, 2019Updated 6 years ago
- Splash Project for parallel stochastic learning☆93Jun 16, 2017Updated 9 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Nov 3, 2016Updated 9 years ago
- Elastic Search on Spark☆110Oct 21, 2014Updated 11 years ago