abij / hadoop-wiki-pageranking
Calculate the PageRank of the pages in the wikipedia dump.
☆53Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for hadoop-wiki-pageranking
- Large-scale ML & graph analytics on Giraph☆79Updated 8 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 11 years ago
- The S-Space repsitory, from the AIrhead-Research group☆205Updated 4 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- ADMM based large scale logistic regression☆335Updated 10 months ago
- Course homepages for courses that I've taught at the University of Maryland☆53Updated 8 years ago
- ☆11Updated 8 years ago
- Random Walk (Personalized PageRank) Algorithms for Large Graphs☆73Updated 8 years ago
- Implementation of the Apriori algorithm using Spark.☆38Updated 10 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆105Updated 7 years ago
- This is a fork of the Stanford Named Entity Recognizer with added support for deploying in Java servlet mode. See github.com/dat/pyner fo…☆90Updated 11 years ago
- Me testing our tensorflow and mnist in java☆25Updated 6 years ago
- Solution to Facebook's link prediction contest on Kaggle.☆204Updated 12 years ago
- A text classifier based on Decision Trees ID3, Naive Bayes and KNN algorithm in C++ and JAVA.☆39Updated 6 years ago
- Recommender Systems in Depth: An introduction to Recommender Systems using Python and Crab☆44Updated 11 years ago
- Analytic UIMA pipelines using Spark☆23Updated 8 years ago
- Repository of code that analysis data from the Yelp Academic Dataset Challenge☆30Updated 5 years ago
- DBpedia.org RDF to CSV for import into Neo4j☆51Updated 9 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆98Updated 13 years ago
- Trident-ML : A realtime online machine learning library☆382Updated 10 months ago
- Locality Sensitive Hashing for Apache Spark☆196Updated 8 years ago
- The Deep Learning training framework on Spark☆220Updated 9 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 5 years ago
- Lintools: tools by @lintool☆22Updated 5 years ago
- ☆53Updated 7 years ago
- GraphChi's Java version☆238Updated 11 months ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆381Updated last year