abij / hadoop-wiki-pageranking
Calculate the PageRank of the pages in the wikipedia dump.
☆52Updated 2 years ago
Alternatives and similar repositories for hadoop-wiki-pageranking:
Users that are interested in hadoop-wiki-pageranking are comparing it to the libraries listed below
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Trident-ML : A realtime online machine learning library☆381Updated last year
- Implementation of the Apriori algorithm using Spark.☆38Updated 10 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Scalable recommendation system written in Scala using the Apache Spark framework☆105Updated 10 years ago
- Simple Spark Application☆76Updated last year
- Vector-free L-BFGS implementation on Spark☆9Updated 9 years ago
- A SimRank algorithm implementation using Spark☆49Updated 11 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- ☆23Updated 9 years ago
- Naive K-Means clustering with MapReduce☆21Updated 3 years ago
- Code for the ACL-2015 paper "Accurate Linear-Time Chinese Word Segmentation via Embedding Matching"☆38Updated 9 years ago
- GraphChi's Java version☆238Updated last year
- Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.☆360Updated 2 years ago
- Entity level sentiment analysis for product reviews using deep learning☆54Updated 8 years ago
- ☆21Updated 9 years ago
- pairwise learning to rank with logistic regression☆19Updated 8 years ago
- Semantic Preserving Embeddings for Generalized Graphs☆31Updated 6 years ago
- Online LDA based on Spark☆17Updated 9 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- Open-domain question answering system from UNC Charlotte☆61Updated 9 years ago
- General Vectorization Lib for Machine Learning Tools☆31Updated 8 years ago
- Scalable Topic Modeling using Variational Inference in MapReduce☆150Updated 9 years ago
- ☆55Updated 10 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- My own comments and modifications to word2vec by Mikolov et al.☆16Updated 9 years ago
- CMU-OAQA LiveQA system☆19Updated 8 years ago
- Solution to Facebook's link prediction contest on Kaggle.☆204Updated 12 years ago
- Criteo/Kaggle Competition of CTR prediction☆130Updated 10 years ago
- ☆18Updated 8 years ago