CSE601-DataMining / ClusteringLinks

Implement three clustering algorithms to find clusters of genes that exhibit similar expression profiles: K-means, Hierarchical Agglomerative clustering with Single Link (Min), and one from (density-based, mixture model, spectral). Set up a single-node Hadoop cluster on your machine and implement MapReduce K-means. Compare with non-parallel…

☆12

Alternatives and similar repositories for Clustering

Users that are interested in Clustering are comparing it to the libraries listed below

Sorting:

Quincy1994 / MachineLearning
☆11Updated 8 years ago
mpk001 / Doc_Word2vecTest
这是Word2vec和Doc2vec的一个应用示例：用Word2vec计算词的相似度和用doc2vec计算句子的相似度。
☆26Updated 7 years ago
SnailWhb / elasticsearch_pulgine_fast-vector-distance
计算两个特征向量的相似度
☆26Updated 6 years ago
wyc941012 / MachineLearningPlatform
基于Spark和Kubernetes的机器学习平台
☆30Updated 7 years ago
WangYang-wy / YangDocker
自助搭建的 hadoop + spark + kafka + zookeeper + storm + hbase + hive + flume 集群，一主两从。
☆30Updated 6 years ago
mayanhui / hbase-secondary-index
Several implementation for building hbase secondary index.
☆39Updated 9 years ago
Skycrab / model-deploy
Spark PMML 模型离线部署
☆12Updated 2 years ago
dylan-fan / spark_graphx_community_detection
☆12Updated 8 years ago
vbay / MusicRecommender
基于Spark MLlib ALS的音乐推荐系统
☆29Updated 9 years ago
Chihuataneo / Spark_Personas
Spark中实现用户画像系统价值度、忠诚度、流失预警、活跃度等模型
☆66Updated 8 years ago
smartloli / Kafka-Eagle-Wiki
Kafka Eagle used to describe the use of Wiki
☆11Updated 5 years ago
gjhkael / cipher
基于hdfs spark的视频非结构化数据计算
☆35Updated 9 years ago
gjhkael / deployDoc
关于Spark的源码分析，以及平时工作的一些总结
☆31Updated 9 years ago
wengbenjue / spark_recomend
使用Spark的MLlib、Hbase作为模型、Hive作数据清洗的核心推荐引擎,在Spark on Yarn测试通过
☆30Updated 8 years ago
ameyamk / spark-streaming-direct-kafka
High Performance Spark Streaming with Direct Kafka in Java
☆39Updated 8 years ago
skypeaceLL / sky-flink-phase2-java
Flink phase2 java code
☆12Updated 2 years ago
endymecy / spark-programming-guide-zh-cn
Spark 编程指南简体中文版
☆33Updated 8 years ago
xxrznj / flink-kafka-sql
kafka传数据到Flink存储到mysql之Flink使用SQL语句聚合数据流（设置时间窗口，EventTime）
☆32Updated 7 years ago
hernad / easyrec
easyrec
☆52Updated 12 years ago
linzhouzhi / recommend
R 语言实现的常用的推荐算法itemCF,UserCF ,Tags,SVD,Apriori
☆18Updated 8 years ago
fansy1990 / movie_recommend
☆21Updated 8 years ago
STHSF / NaturalLanguageProcessing_Spark
Natural Language Processing algorithm including TextClassification, sentiment analysis, TextRank, LDA and so on
☆11Updated 8 years ago
RickyHuo / elasticsearch-analysis-pinyin
This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.
☆10Updated 6 years ago
shirdrn / libsvm-dp
Refactored version for https://github.com/shirdrn/document-processor.git
☆15Updated 8 years ago
ml-distribution / semantic-similarity
语义、情感、相似度分析。
☆58Updated 9 years ago
mvalleavila / Kafka-Storm-ElasticSearch
☆24Updated 9 years ago
ml-distribution / negative-sentiment
基于词典的负面舆情信息评分算法。
☆26Updated 10 years ago
Java-Edge / Spark-MLlib-Tutorial
大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件
☆40Updated last year
xiaohei-info / spark-demo
spark高级数据分析
☆20Updated 9 years ago
x-shadow-x / TextCluster
常用文本聚类算法java实现
☆15Updated 10 years ago