black-tea / data-projects
A compendium of data projects and associated blog posts
☆10Updated 5 years ago
Alternatives and similar repositories for data-projects:
Users that are interested in data-projects are comparing it to the libraries listed below
- Spark NLP for Streamlit☆15Updated 3 years ago
- ☆16Updated 4 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 8 years ago
- Skill Representations in Vector Space☆34Updated last year
- An analysis of traffic accident data for the UK in 2014, using data from the UK Data Service. (Sourced from Kaggle with original data com…☆12Updated 7 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Propensity models make true predictions about a customer’s future behavior. With propensity models you can truly anticipate a customer's …☆17Updated 5 years ago
- In-Session Personalization Workshop for eCommerce, April 2021, and the MICES Workshop in June 2021.☆22Updated 3 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Advanced Python visualization library for Association Rules☆8Updated 3 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Topic Modelling for Humans☆22Updated 7 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- MLinProduction SageMaker workshop hosted in April 2020☆15Updated 4 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- ☆23Updated last year
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Predicting the Likelihood to Purchase a Financial Product Following a Direct Marketing Campaign☆27Updated 2 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- ☆10Updated 4 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 2 years ago
- Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 7 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- store my personal project☆22Updated 4 years ago
- Jupyter Notebooks for Bussiness2Vector☆9Updated 6 years ago
- A curated list of references for MLOps☆13Updated 4 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librar…☆30Updated 2 years ago