MLWhiz / Spark_Projects
Spark Projects for the Berkeley Data Science Course
☆11Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for Spark_Projects
- ☆14Updated 6 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆29Updated 3 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- ☆11Updated 6 years ago
- Follow the Lumiata Tech Blog on Medium!☆21Updated last year
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- A tutorial to create python based prediction web app☆30Updated 4 years ago
- ☆39Updated 7 years ago
- ☆19Updated 3 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- ☆26Updated 10 months ago
- Work for Mastering Large Datasets with Python☆18Updated last year
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Spark and Python (PySpark) Examples☆39Updated 3 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- E-Commerce Website A/B testing: Recommend which of two landing pages to keep based on A/B testing☆24Updated 6 years ago
- Tips for Advanced Feature Engineering☆52Updated 4 years ago
- helpful resources for (big) data science☆33Updated 3 years ago
- Contains source files used in the Spark with Python course☆18Updated 5 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 6 years ago
- Applying automated feature engineering to the Kaggle Home Credit Default Risk Competition☆18Updated 6 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Bare minimum End-to-End ML application with Flask REST API Prediction Service☆12Updated 4 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Few tutorials on pandas, matplotlib and seaborn☆26Updated 8 years ago
- How to do data science with Optimus, Spark and Python.☆18Updated 5 years ago