This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
☆58Jan 29, 2026Updated last month
Alternatives and similar repositories for MLHadoop
Users that are interested in MLHadoop are comparing it to the libraries listed below
Sorting:
- (java) K nearest neighbour implementation for Hadoop MapReduce☆24Jul 29, 2015Updated 10 years ago
- K-Means Clustering using MapReduce☆74May 20, 2022Updated 3 years ago
- C4.5 is a commonly used in decision tree algorithm in data mining for classification. The existing C4.5 algorithm implementation is runni…☆14May 6, 2014Updated 11 years ago
- 超实用的hive表数据、分区,hdfs文件的自动化清理工具☆20Jun 21, 2022Updated 3 years ago
- Library to run in process Kafka broker☆16Nov 20, 2018Updated 7 years ago
- 专注大数据 Spark ML 机器学习:监督学习、无监督学习,主要有:分类算法、回归算法、聚类算法、推荐算法、频繁模式挖掘算法☆17Nov 6, 2020Updated 5 years ago
- KNN算法基于Hadoop平台的MapReduce实现☆12Jun 28, 2020Updated 5 years ago
- A tool that USES a template engine to generate data☆16Feb 5, 2024Updated 2 years ago
- ☆13Oct 16, 2025Updated 5 months ago
- Examples of Recommendations powered by MapReduce and mrjob☆56Aug 24, 2012Updated 13 years ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 10 years ago
- Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled wit…☆18Feb 20, 2011Updated 15 years ago
- gulp plugin to convert html file to txt.☆10May 1, 2020Updated 5 years ago
- ☆22Oct 12, 2020Updated 5 years ago
- This project is a simple example of integration of Swagger with Spring 3 application (with Jersey 2).☆21Apr 6, 2015Updated 10 years ago
- ambari2.6集成flink1.9.1,网上其它项目支持的是standalone模式,此项目支持flink on yarn模式☆13Jul 23, 2020Updated 5 years ago
- ☆14Feb 25, 2020Updated 6 years ago
- Aho Corasick Multiple String search algorithm, based on Danny Yoo's implementation.☆23Feb 1, 2011Updated 15 years ago
- Onsite Analysis Infrastructure☆16Jun 23, 2020Updated 5 years ago
- ambari2.7.4,hdp3.1.4集成hue4.6.0,均是最新版☆19Jul 23, 2020Updated 5 years ago
- 思科vpn客户端☆12Nov 24, 2016Updated 9 years ago
- Implementation of text clustering algorithms including K-means, MBSAS, DBSCAN.☆44Nov 30, 2017Updated 8 years ago
- Implement Latent Aspect Rating Analysis in Python☆11Feb 4, 2017Updated 9 years ago
- ☆13Jul 19, 2018Updated 7 years ago
- node2vec implemented with Java☆12Nov 7, 2018Updated 7 years ago
- ☆12Feb 18, 2021Updated 5 years ago
- This is a Java-based benchmark for matrix approximation based collaborative filtering☆11Jul 12, 2025Updated 8 months ago
- In-graph collections for the Neo4j graph database.☆49Dec 16, 2023Updated 2 years ago
- 基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等☆45May 30, 2023Updated 2 years ago
- Neural recommender system implementation in TensorFlow.☆15Mar 24, 2023Updated 2 years ago
- ☆14Mar 11, 2014Updated 12 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Dec 8, 2022Updated 3 years ago
- NCG acceleration of ALS computing low rank matrix factorizations for Collaborative Filtering☆14Feb 15, 2016Updated 10 years ago
- Extensions for the Orson Charts library to support JavaFX.☆17May 22, 2025Updated 9 months ago
- 中文文本挖掘|舆情分析|Hadoop|Java|MapReduce☆23Dec 25, 2017Updated 8 years ago
- ☆11Sep 8, 2016Updated 9 years ago
- Package implements decision tree and isolation forest☆12Jun 9, 2017Updated 8 years ago
- API functions for Malware Research☆35Jul 9, 2019Updated 6 years ago
- 量化投资单因子分析☆22Aug 29, 2019Updated 6 years ago