CSE601-DataMining / ClusteringLinks
Implement three clustering algorithms to find clusters of genes that exhibit similar expression profiles: K-means, Hierarchical Agglomerative clustering with Single Link (Min), and one from (density-based, mixture model, spectral). Set up a single-node Hadoop cluster on your machine and implement MapReduce K-means. Compare with non-parallel…
☆12Updated 10 years ago
Alternatives and similar repositories for Clustering
Users that are interested in Clustering are comparing it to the libraries listed below
Sorting:
- ☆11Updated 8 years ago
- 这是Word2vec和Doc2vec的一个应用示例:用Word2vec计算词的相似度和用doc2vec计算句子的相似度。☆26Updated 7 years ago
- 计算两个特征向量的相似度☆26Updated 6 years ago
- 基于Spark和Kubernetes的机器学习平台☆30Updated 7 years ago
- 自助搭建的 hadoop + spark + kafka + zookeeper + storm + hbase + hive + flume 集群,一主两从。☆30Updated 6 years ago
- Several implementation for building hbase secondary index.☆39Updated 9 years ago
- Spark PMML 模型离线部署☆12Updated 2 years ago
- ☆12Updated 8 years ago
- 基于Spark MLlib ALS的音乐推荐系统☆29Updated 9 years ago
- Spark中实现用户画像系统价值度、忠诚度、流失预警、活跃度等模型☆66Updated 8 years ago
- Kafka Eagle used to describe the use of Wiki☆11Updated 5 years ago
- 基于hdfs spark的视频非结构化数据计算☆35Updated 9 years ago
- 关于Spark的源码分析,以及平时工作的一些总结☆31Updated 9 years ago
- 使用Spark的MLlib、Hbase作为模型、Hive作数据清洗的核心推荐引擎,在Spark on Yarn测试通过☆30Updated 8 years ago
- High Performance Spark Streaming with Direct Kafka in Java☆39Updated 8 years ago
- Flink phase2 java code☆12Updated 2 years ago
- Spark 编程指南简体中文版☆33Updated 8 years ago
- kafka传数据到Flink存储到mysql之Flink使用SQL语句聚合数据流(设置时间窗 口,EventTime)☆32Updated 7 years ago
- easyrec☆52Updated 12 years ago
- R 语言实现的常用的推荐算法itemCF,UserCF ,Tags,SVD,Apriori☆18Updated 8 years ago
- ☆21Updated 8 years ago
- Natural Language Processing algorithm including TextClassification, sentiment analysis, TextRank, LDA and so on☆11Updated 8 years ago
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Updated 6 years ago
- Refactored version for https://github.com/shirdrn/document-processor.git☆15Updated 8 years ago
- 语义、情感、相似度分析。☆58Updated 9 years ago
- ☆24Updated 9 years ago
- 基于词典的负面舆情信息评分算法。☆26Updated 10 years ago
- 大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试 文件☆40Updated last year
- spark高级数据分析☆20Updated 9 years ago
- 常用文本聚类算法java实现☆15Updated 10 years ago