cjdd3b / pairwise-mapreduce
Implementation of a pairwise document similarity algorithm using MapReduce.
☆15Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for pairwise-mapreduce
- Public Machine Learning and Data Competition Repo☆54Updated 8 years ago
- Stability analysis for topic models☆50Updated 8 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- [hibernating] Dynamic topic models☆39Updated 9 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- Visualization of text sentiment using deep learning☆44Updated 8 years ago
- Different approaches to computing document similarity☆28Updated 7 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 7 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Updated 7 years ago
- Some examples of Yhat☆23Updated 10 years ago
- Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics☆82Updated 2 years ago
- Theano implementation of GloVe for graphs☆46Updated 9 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 5 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 5 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 6 years ago
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆36Updated last year
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Tensor-based recommender system that incorporates categorical contextual information into collaborative filtering workflow.☆26Updated 8 years ago
- The notes and slides from my PyCon Ireland 2016 PyData talk an introduction to gradient boosting☆18Updated 8 years ago
- Collection of dask example notebooks☆57Updated 6 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 9 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 8 years ago
- Repo for Working with Open Data (Spring 2014 edition), a course at the School of Information, UC Berkeley☆34Updated 8 years ago
- Algorithms for "schema matching"☆25Updated 8 years ago
- Tutorial on "Modern Optimization Methods in Python"☆18Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Recommender systems in Python☆50Updated 9 years ago
- Turning news into events since 2014.☆50Updated 7 years ago