punit-naik / MLHadoop
This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
☆57Updated last year
Alternatives and similar repositories for MLHadoop:
Users that are interested in MLHadoop are comparing it to the libraries listed below
- ☆19Updated 6 years ago
- K-Means Clustering using MapReduce☆76Updated 2 years ago
- Item and User-based KNN recommendation algorithms using PySpark☆126Updated 7 years ago
- A java implementation of LightGBM predicting part☆84Updated last year
- Deep Learning Pipelines for Apache Spark☆58Updated 7 years ago
- movie recommendation demo using collaborative filtering and lfm(spark mllib ALS)☆95Updated 8 years ago
- An example Python ALS recommender system☆24Updated 9 years ago
- A parallel distributed implementation of DBSCAN on Spark using Python☆75Updated 6 years ago
- News recommendation system based on spark.☆47Updated 8 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆106Updated 8 years ago
- Examples of Invoking TensorFlow from Java☆74Updated 6 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32Updated 6 years ago
- spark mllib example☆28Updated 9 years ago
- 这是一个最大熵的简明Java实现,提供提供训练与预测接口。训练算法采用GIS训练算法,附带示例训练集和一个天气预测的Demo。☆47Updated 10 years ago
- LSTM and GRU in JAVA☆113Updated 5 years ago
- ☆15Updated 5 years ago
- code exercise: dbscan(ballTree improve) | ctr(ftrl) | text classification(bayes..) | kmeans | general LR |..☆26Updated 9 years ago
- field-aware factorization machine implemented by java with an experiment using criteo data set.☆39Updated 9 years ago
- graphx example☆24Updated 9 years ago
- Machine Learning with Spark - Second Edition, by Packt☆115Updated 4 years ago
- Spark GraphX - Pregel, PageRank and Dijkstra on a social graph☆23Updated 6 years ago
- ☆14Updated 9 years ago
- word2vec的Java并行实现☆126Updated 8 years ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Updated 7 years ago
- Java interface for fastText☆231Updated last year
- JPMML-SparkML plugin for converting XGBoost4J-Spark models to PMML☆36Updated 4 years ago
- Java library and command-line application for converting TensorFlow models to PMML☆75Updated 6 years ago
- Spark algorithms for building k-nn graphs☆42Updated 6 years ago
- someCode with tianchi☆24Updated 9 years ago
- JARs for XGBoost built on Linux, OS X and Windows☆52Updated 4 years ago