punit-naik / MLHadoop
This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
☆57Updated 2 years ago
Alternatives and similar repositories for MLHadoop
Users that are interested in MLHadoop are comparing it to the libraries listed below
Sorting:
- Machine Learning with Spark - Second Edition, by Packt☆115Updated 4 years ago
- Deep Learning Pipelines for Apache Spark☆58Updated 7 years ago
- Item and User-based KNN recommendation algorithms using PySpark☆126Updated 7 years ago
- K-Means Clustering using MapReduce☆75Updated 2 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32Updated 6 years ago
- A parallel distributed implementation of DBSCAN on Spark using Python☆75Updated 6 years ago
- Java library and command-line application for converting TensorFlow models to PMML☆75Updated 7 years ago
- 机器学习项目☆38Updated 8 years ago
- Spark SQL UDF examples☆56Updated 7 years ago
- A java implementation of LightGBM predicting part☆84Updated last year
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆106Updated 9 years ago
- spark mllib example☆28Updated 9 years ago
- graphx example☆24Updated 9 years ago
- News recommendation system based on spark.☆47Updated 8 years ago
- ☆41Updated 8 years ago
- ☆28Updated 6 years ago
- Criteo/Kaggle Competition of CTR prediction☆130Updated 10 years ago
- Spark MLlib Learning☆71Updated 8 years ago
- Spark algorithms for building k-nn graphs☆42Updated 6 years ago
- Item-Based Collaborative Filtering Spark Job (use cosin similarity)☆37Updated 8 years ago
- use xgboost and lr model for text classification. xgboost is used to be a feature transform for LR☆44Updated 7 years ago
- Create scalable machine learning applications to power a modern data-driven business using Spark☆60Updated 2 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark☆88Updated 6 years ago
- Recommendation engine based on contextual word embeddings☆136Updated 8 years ago
- Simple examle for Spark Streaming over Kafka topic☆106Updated 4 years ago
- movie recommendation demo using collaborative filtering and lfm(spark mllib ALS)☆95Updated 8 years ago
- An implementation of GBDT+FM☆24Updated 8 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆183Updated 7 years ago
- Build a News Recommendation Engine Using Apache Mahout and the Google News Personalization Paper☆23Updated 12 years ago