angang-li / sparkify
Predict churn with Apache Spark
☆12Updated 5 years ago
Alternatives and similar repositories for sparkify:
Users that are interested in sparkify are comparing it to the libraries listed below
- Keep learning something new☆21Updated 3 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆158Updated 5 months ago
- Jupyter notebooks for pyspark tutorials given at University☆107Updated last month
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- My Udacity Data Engineer Nano Degree Projects aka Udacity DEND☆16Updated 4 years ago
- Getting start with PySpark and MLlib☆297Updated 6 years ago
- Notes on Apache Spark (pyspark)☆296Updated 5 years ago
- Live Training: Market Basket Analysis in Python☆43Updated 4 years ago
- Project work for Udacity's AB Testing Course☆82Updated 7 years ago
- Code examples on Apache Spark using python☆106Updated 2 years ago
- Because its never late to start taking notes and 'public' it...☆60Updated 2 months ago
- Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.☆17Updated 2 years ago
- Course on Udemy by Jose Portilla☆97Updated 7 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆40Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 3 years ago
- AB Testing☆13Updated 5 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆270Updated 5 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆99Updated 4 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆80Updated 5 years ago
- ☆148Updated 6 years ago
- This repository contains Spark, MLlib, PySpark and Dataframes projects☆43Updated 7 years ago
- Production repo to accompany Deep Learning with Structured Data book from Manning: https://www.manning.com/books/deep-learning-with-struc…☆72Updated 3 years ago
- This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel☆189Updated 3 years ago
- Udacity Data Science Nanodegree Capstone☆35Updated 4 years ago
- Pytest for Data Science Beginners☆58Updated 6 years ago
- Live Training Session: Cleaning Data with Pyspark☆15Updated 4 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- ☆19Updated 6 years ago
- A curated list of repositories for my book Machine Learning Solutions.☆78Updated 6 years ago