black-tea / data-projects
A compendium of data projects and associated blog posts
☆10Updated 5 years ago
Alternatives and similar repositories for data-projects
Users that are interested in data-projects are comparing it to the libraries listed below
Sorting:
- classify a job description (or noisy job title) into a ONET job title☆19Updated 8 years ago
- Topic Modelling for Humans☆22Updated 7 years ago
- DJIA index prices of 10 years and NYtimes news articles headline has been used to predict the DJIA index prices☆17Updated 7 years ago
- Tutorials on session-based recommender systems☆11Updated 8 years ago
- Automatic Text Summarization with Machine Learning☆16Updated 7 years ago
- ☆10Updated 4 years ago
- store my personal project☆22Updated 4 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- ngram graphs library☆12Updated 3 years ago
- Movie recommendations based on user written passages about preferred movies.☆16Updated 5 years ago
- KDD Hands-On Tutorial (2018)☆29Updated 2 years ago
- ☆16Updated 4 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Using NLP to cluster reddit user comments by topics☆13Updated 7 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- ☆36Updated 8 years ago
- Follow the Lumiata Tech Blog on Medium!☆21Updated 2 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Large-scale Graph Mining with Spark☆40Updated 6 years ago
- Teaching material and other info associated with the Information Extraction using Topic Models tutorial at SciPy US 2018.☆19Updated 6 years ago
- Slides, code and more for my class: Data Analytics and Machine Learning on Big Data☆8Updated 7 years ago
- Propensity models make true predictions about a customer’s future behavior. With propensity models you can truly anticipate a customer's …☆17Updated 5 years ago
- Advanced Python visualization library for Association Rules☆8Updated 3 years ago
- An analysis of traffic accident data for the UK in 2014, using data from the UK Data Service. (Sourced from Kaggle with original data com…☆12Updated 7 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 6 years ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆46Updated 7 years ago
- Embeddings for all geonames populated locations with population greater than 0☆13Updated 8 years ago
- An active annotation tool based on brat(https://github.com/nlplab/brat)☆19Updated 7 years ago