lp-dataninja / SparkML
Detailed notes and code to learn machine learning with Apache Spark.
☆13Updated 6 years ago
Alternatives and similar repositories for SparkML:
Users that are interested in SparkML are comparing it to the libraries listed below
- A machine learning algorithm written to predict severity of insurance claim☆19Updated 8 years ago
- A project on machine learning techniques dealing with imbalanced classification (Python)☆11Updated 7 years ago
- The code to generate a top 20 score in the amazon classification challenge using Driverless AI's predictions and feature engineering : In…☆18Updated 7 years ago
- Variational deep autoencoder to predict churn customer☆28Updated 6 years ago
- (117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.☆26Updated 5 years ago
- Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data explorat…☆30Updated 5 years ago
- Follow the Lumiata Tech Blog on Medium!☆21Updated last year
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- ☆9Updated 5 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Hands on Unsupervised Learning with Python [Video], Published by Packt☆29Updated 2 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- This repository contains Time series Analysis and Forecasting tutorial from Analytics Vidhya☆22Updated 6 years ago
- ☆11Updated 4 years ago
- ☆22Updated last year
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 6 years ago
- ☆19Updated 3 years ago
- Resources for Data Science Kick Starter Workshop at ODSC India 2019☆20Updated 4 years ago
- Creating a Gradio user interface to predict the sentiment of a tweet☆12Updated 3 years ago
- Insurance Claim Prediction using Machine Learning - Udacity Nanodegree Capstone Project☆16Updated 8 years ago
- Program assignments for the Deep Learning Specialization at Coursera by Andrew Ng☆51Updated 7 years ago
- Repository for medium article☆22Updated last year
- ☆18Updated 3 years ago
- Forecasting Uber demand in NYC neighborhoods☆34Updated 6 years ago
- Yellowbrick is an open source, Python project that extends the scikit-learn API with visual analysis and diagnostic tools☆13Updated 5 years ago
- Bayesian AB Tests Examples☆22Updated 2 years ago
- end-to-end Machine Learning model with MLlib in pySpark, For a Binary Classification problem with Imbalanced Classes☆8Updated 5 years ago
- Time Series Forecasting for the M5 Competition☆41Updated 3 years ago
- "Building a Recommender System from Scratch" Workshop Material for PyDataDC 2018☆24Updated 6 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago