cjdd3b / pairwise-mapreduce
Implementation of a pairwise document similarity algorithm using MapReduce.
☆15Updated 13 years ago
Alternatives and similar repositories for pairwise-mapreduce:
Users that are interested in pairwise-mapreduce are comparing it to the libraries listed below
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 8 years ago
- Different approaches to computing document similarity☆28Updated 8 years ago
- Some examples of Yhat☆23Updated 10 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆42Updated 8 years ago
- Textual Analysis of speeches using Google's Word2Vec Model☆31Updated 4 years ago
- R files containing the code used to predict rugby world cup matches☆10Updated 9 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- Tutorial on "Modern Optimization Methods in Python"☆18Updated 8 years ago
- Scikit-learn quickstart tutorial for Webstep☆18Updated 7 years ago
- kaggle walmart-recruiting-sales-in-stormy-weather☆47Updated 9 years ago
- A set of methods that predict the future values of popularity indices for news posts using a variety of features.☆33Updated 7 years ago
- ☆26Updated 9 years ago
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- ☆28Updated 8 years ago
- Cython implementation of DeepWalk☆54Updated last year
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- ☆13Updated 5 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆65Updated 7 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆13Updated 8 years ago
- Module 7: Introduction to D3.js☆21Updated 8 years ago
- Repo for Working with Open Data (Spring 2014 edition), a course at the School of Information, UC Berkeley☆34Updated 9 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Updated 7 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 9 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- Datasets and notebooks☆13Updated 8 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 10 years ago
- Kaggle competition☆23Updated 9 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Code and data for "The Geometry of Classifiers"☆26Updated 4 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago