rdempsey / pyspark-for-data-processingLinks
Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Updated 7 years ago
Alternatives and similar repositories for pyspark-for-data-processing
Users that are interested in pyspark-for-data-processing are comparing it to the libraries listed below
Sorting:
- Few tutorials on pandas, matplotlib and seaborn☆27Updated 9 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- A tutorial to create python based prediction web app☆30Updated 5 years ago
- These are the slides and code for my tutorial "Computer Vision: an (Un?)Expected Journey" at PyData London 2018☆30Updated 7 years ago
- ☆39Updated 8 years ago
- Collection of presentation of my work on various platforms and meetups☆22Updated 6 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated 2 years ago
- pyspark sample scripts☆17Updated 6 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆64Updated 2 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 7 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Slides and materials for most of my talks by year☆92Updated 2 years ago
- ML Nanodegree Capstone Project - Predicting NYC Taxi Trip Duration☆12Updated 7 years ago
- A machine learning algorithm written to predict severity of insurance claim☆19Updated 8 years ago
- ☆26Updated last year
- Workshop: Python for Data Science☆62Updated 10 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- ☆33Updated last year
- This is a machine learning challenge conducted by C&D Labs and Future Group in association with HackerEarth.☆10Updated 7 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆71Updated 2 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 7 years ago
- Workshop and lesson on Exploratory Data Analysis☆12Updated 5 months ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆37Updated 5 years ago
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago