endymecy/AlgorithmsOnSpark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/endymecy/AlgorithmsOnSpark)

endymecy / AlgorithmsOnSpark

Some popular algorithms(dbscan,knn,fm etc.) on spark

☆32

Alternatives and similar repositories for AlgorithmsOnSpark

Users that are interested in AlgorithmsOnSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AtlasPilotPuppy / SparkAlgorithms
View on GitHub
Additional useful algorithms that can be used with spark.
☆24Dec 24, 2014Updated 11 years ago
KDDtest / SCNN
View on GitHub
☆12Apr 19, 2024Updated 2 years ago
linux-devil / spark-dbscan
View on GitHub
DBSCAN clustering algorithm implemented in Apache Spark (MapReduce Framework).
☆13May 5, 2016Updated 10 years ago
irvingc / dbscan-on-spark
View on GitHub
An implementation of DBSCAN runing on top of Apache Spark
☆181Jan 10, 2018Updated 8 years ago
picnicml / doddle-model-examples
View on GitHub
doddle-model code examples
☆19Sep 23, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
alexander-n-thomas / nlp.spark.annotate
View on GitHub
notebooks for nlp-on-spark
☆13Jan 27, 2017Updated 9 years ago
qf6101 / multinomial-factorization-machines
View on GitHub
Multinomial Factorization Machines
☆21Oct 17, 2016Updated 9 years ago
jdinkla / location-based-nearest-neighbours
View on GitHub
Using k-d trees with Apache Spark and Scala
☆11Jun 23, 2026Updated 3 weeks ago
xubo245 / SparkLearning_NoData
View on GitHub
SparkLearning_NoData, including code,pom and so on
☆13Mar 21, 2017Updated 9 years ago
hibayesian / spark-fm
View on GitHub
A parallel implementation of factorization machines based on Spark
☆75Jun 28, 2020Updated 6 years ago
Puyodead1 / udemy-dl-go
View on GitHub
A WIP Udemy downloader written in Go
☆11Mar 20, 2022Updated 4 years ago
tapanalyticstoolkit / spark-tk
View on GitHub
Python and Scala APIs for enhanced Spark analytics
☆12Mar 15, 2017Updated 9 years ago
dleung / spark-streaming-kafka-example
View on GitHub
An example project using Spark Streaming with Kafka message and Avro serialization
☆12Aug 21, 2015Updated 10 years ago
xmrec / xmrec.github.io
View on GitHub
☆23Dec 16, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jianzhu / dl-rerank
View on GitHub
☆11May 8, 2020Updated 6 years ago
myd620 / boolindexer
View on GitHub
一个用于检索布尔表达式的库
☆10May 22, 2018Updated 8 years ago
kaist-dmlab / RP-DBSCAN
View on GitHub
☆59Jan 28, 2020Updated 6 years ago
Angel-ML / sona
View on GitHub
Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models
☆85Jan 2, 2023Updated 3 years ago
Gschiavon / Kafka-SparkStreaming-HDFS
View on GitHub
☆14Nov 3, 2016Updated 9 years ago
vinayprabhu / Favorite_PyPi_2020
View on GitHub
Collection of my favorite Python packages from 2020
☆11Jan 12, 2021Updated 5 years ago
seanpquig / confluent-platform-spark-streaming
View on GitHub
Working example of consuming Avro data from Kafka with Spark Streaming
☆12Feb 21, 2016Updated 10 years ago
MLnick / glint-fm
View on GitHub
Factorization Machines on Spark and Glint
☆25Nov 7, 2016Updated 9 years ago
shilinlee / blog
View on GitHub
博客
☆16Sep 17, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bomeng / Heracles
View on GitHub
High performance HBase / Spark SQL engine
☆28Jul 7, 2022Updated 4 years ago
manasbundele / big-data-projects
View on GitHub
These are a select few projects related to Big Data Analytics and Management. The projects listed are a combination of both small and big…
☆11Oct 11, 2019Updated 6 years ago
viirya / SparkAffinityPropagation
View on GitHub
Affinity Propagation on Spark
☆20May 31, 2021Updated 5 years ago
JerryLead / SparkFaultBench
View on GitHub
A Spark Reliability Testing Suite
☆13Jan 10, 2017Updated 9 years ago
endymecy / spark-config-and-tuning
View on GitHub
spark性能调优总结 spark config and tuning
☆118Mar 9, 2018Updated 8 years ago
vineetpandey / HackerRank---The-Linux-Shell-Problems_Solutions
View on GitHub
Problems can be found over - https://www.hackerrank.com/domains/shell/bash/
☆13Jan 20, 2015Updated 11 years ago
YCG09 / xgbspark-text-classification
View on GitHub
XGBoost on Spark for Chinese Text Classification
☆46May 31, 2018Updated 8 years ago
yjfiejd / Sales_prediction
View on GitHub
Rossmann Store Sales: https://www.kaggle.com/c/rossmann-store-sales
☆10May 13, 2018Updated 8 years ago
BigDataScholar / FlinkECUserBehaviorAnalysis
View on GitHub
通过观看尚硅谷的Flink实战视频，开了一个仓库，记录源码和一些所需要的数据文件，也欢迎大家积极讨论
☆17Mar 1, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kunguang / SelectFeature
View on GitHub
主要解决ctr预估工程中的特征选择，特征编号(特征离散),单特征auc和logloss这3个问题.
☆20Mar 30, 2017Updated 9 years ago
spoddutur / spark-streaming-monitoring-with-lightning
View on GitHub
Plot live-stats as graph from ApacheSpark application using Lightning-viz
☆18Jul 3, 2017Updated 9 years ago
ZiYu0427 / spark
View on GitHub
☆24Mar 11, 2016Updated 10 years ago
anilmuppalla / hpdc-scalding-spark
View on GitHub
Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark
☆15Oct 6, 2017Updated 8 years ago
obackhoff / spark-clustream
View on GitHub
☆13Nov 2, 2017Updated 8 years ago
rmetzger / flink-streaming-etl
View on GitHub
A demo repository for "streaming etl" with Apache Flink
☆44Jun 8, 2016Updated 10 years ago
netease-bigdata / ne-spark-courseware
View on GitHub
NetEase Spark Courses
☆15Sep 4, 2018Updated 7 years ago