intel-spark/SparseML

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intel-spark/SparseML)

intel-spark / SparseML

Spark MLlib code optimized to efficiently support sparse data

☆51

Alternatives and similar repositories for SparseML

Users that are interested in SparseML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intel-spark / StatisticsOnSpark
View on GitHub
Assembly of fundamental statistics implemented based on Apache Spark
☆31Feb 11, 2016Updated 10 years ago
intel-spark / TopicModeling
View on GitHub
Topic Modeling on Apache Spark
☆94Mar 1, 2019Updated 7 years ago
Open-Network-Insight / oni-ml
View on GitHub
The machine learning component of Open Network Insight: scalable analytics combining spark for big data and C / MPI for high performance …
☆13Nov 9, 2016Updated 9 years ago
intel-machine-learning / DistML
View on GitHub
DistML provide a supplement to mllib to support model-parallel on Spark
☆170Feb 6, 2017Updated 9 years ago
rjagerman / glint
View on GitHub
Glint: High performance scala parameter server
☆170Jul 20, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
intel / BigDL-core
View on GitHub
Core HW bindings and optimizations for BigDL
☆37Nov 24, 2025Updated 7 months ago
fabiopetroni / libfm_with_BPR
View on GitHub
☆20Dec 1, 2016Updated 9 years ago
intel-spark / InformationExtraction
View on GitHub
☆21Oct 13, 2016Updated 9 years ago
benoitdancoisne / SparkMaxFlow
View on GitHub
Spark implementation of Ford-Fulkerson algorithm
☆14Feb 11, 2018Updated 8 years ago
maropu / hivemall-spark
View on GitHub
A Hivemall wrapper for Spark
☆31Apr 21, 2016Updated 10 years ago
memsql / streamliner-examples
View on GitHub
Example code for building your own MemSQL Streamliner Pipelines
☆23Apr 18, 2017Updated 9 years ago
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
brkyvz / lazy-linalg
View on GitHub
A package full of linear algebra operators for Apache Spark MLlib's linalg package
☆10Sep 9, 2015Updated 10 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vicpara / exploratory-data-analysis
View on GitHub
Spark library for doing exploratory data analysis in a scalable way
☆43Jan 17, 2016Updated 10 years ago
influxdata / influxdb-scala
View on GitHub
Scala client for InfluxDB
☆22Nov 15, 2022Updated 3 years ago
Angel-ML / sona
View on GitHub
Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models
☆85Jan 2, 2023Updated 3 years ago
pomadchin / accumulo-spark
View on GitHub
Docker containers with Apache Accumulo and Apache Spark environment.
☆12Jan 22, 2016Updated 10 years ago
szilard / xgboost-adv-workshop-LA
View on GitHub
Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016
☆27Nov 21, 2016Updated 9 years ago
TrueCar / mleap
View on GitHub
MLeap allows for easily putting Spark ML pipelines into production
☆78Oct 27, 2016Updated 9 years ago
PasaLab / marlin
View on GitHub
A Distributed Matrix Operations Library Built on Top of Spark
☆110Dec 28, 2016Updated 9 years ago
kanyun-inc / ytk-mp4j
View on GitHub
Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…
☆112Jun 14, 2017Updated 9 years ago
linkedin / photon-ml
View on GitHub
A scalable machine learning library on Apache Spark
☆797Aug 30, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JuliaDatabases / Hive.jl
View on GitHub
Hive, Spark SQL, Impala client. Based on Thrift and HiveServer2 protocol.
☆15Oct 12, 2020Updated 5 years ago
kzhai / InfVocLDA
View on GitHub
Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference
☆74Sep 28, 2015Updated 10 years ago
seglo / exactly-once-streams
View on GitHub
An engineering report on using transactions in Kafka 0.11.0.0
☆19Feb 27, 2018Updated 8 years ago
adatao / tensorspark
View on GitHub
TensorFlow on Spark
☆295Oct 19, 2017Updated 8 years ago
avulanov / scala-blas
View on GitHub
Benchmarks of BLAS libraries with Scala interface
☆30Jan 21, 2016Updated 10 years ago
kotakanbe / go-pingscanner
View on GitHub
Scanning alive hosts of the given CIDR range in parallel.
☆10May 8, 2025Updated last year
cloudml / zen
View on GitHub
Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…
☆169Nov 17, 2018Updated 7 years ago
akopich / dplsa
View on GitHub
Distributed implementation of Robust PLSA using Spark
☆12Apr 29, 2021Updated 5 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tapanalyticstoolkit / spark-tensorflow-connector
View on GitHub
Import and export TensorFlow records from/to Spark
☆18Jul 7, 2017Updated 9 years ago
irvingc / dbscan-on-spark
View on GitHub
An implementation of DBSCAN runing on top of Apache Spark
☆181Jan 10, 2018Updated 8 years ago
chimpler / blog-spark-food-recommendation
View on GitHub
Simple example on how to use recommenders in Spark / MLlib
☆70Oct 15, 2020Updated 5 years ago
lucidworks / hbase-indexer
View on GitHub
HBase Indexer - indexing HBase to Solr 5.x and higher
☆13Oct 27, 2017Updated 8 years ago
amplab / ml-matrix
View on GitHub
Distributed Matrix Library
☆73Jan 28, 2017Updated 9 years ago
brkyvz / streaming-matrix-factorization
View on GitHub
Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems
☆109Mar 25, 2016Updated 10 years ago
lightbend / flink-k8s-operator
View on GitHub
An example of building kubernetes operator (Flink) using Abstract operator's framework
☆26Jul 12, 2019Updated 7 years ago