asarraf / Algorithm-Implementation-Using-Map-Reduce
Page Rank, Inverted Index and Matrix Multiplication
☆9Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for Algorithm-Implementation-Using-Map-Reduce
- Merck challenge at Kaggle☆10Updated 10 years ago
- Programming assignments for Introduction to Recommendation Systems course on Coursera.org☆16Updated 3 years ago
- ☆20Updated 8 years ago
- ☆13Updated 6 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- ☆19Updated 8 years ago
- Notes from Stanford NLP class☆24Updated 11 years ago
- Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Updated 8 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆29Updated 10 years ago
- ☆24Updated 9 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 8 years ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Updated 7 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- ☆20Updated 7 years ago
- Content based Recommender System which implements sentiment analysis(Naive Bayes,SVMs) on Amazon product reviews. Built in Python(Beautif…☆10Updated 9 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- The programming assignments of Natural Language Processing by Michael Collins on Coursera☆14Updated 11 years ago
- Stanford machine learning class on Coursera. Taught by Andrew Ng. Implemented the assignments with Matlab.☆24Updated 7 years ago
- ☆55Updated 10 years ago
- Predicting closed questions on Stack Overflow☆46Updated 6 years ago
- Clustering documents based on LSH☆14Updated 8 years ago
- An API for Distributed Machine Learning☆154Updated 8 years ago
- Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition☆56Updated 8 years ago
- Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?…☆9Updated 9 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago