rdempsey / pyspark-for-data-processing
Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Updated 7 years ago
Alternatives and similar repositories for pyspark-for-data-processing:
Users that are interested in pyspark-for-data-processing are comparing it to the libraries listed below
- pyspark sample scripts☆17Updated 6 years ago
- ☆40Updated 7 years ago
- Slides, code and more for my class: Data Analytics and Machine Learning on Big Data☆8Updated 7 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- ☆19Updated 4 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- ☆26Updated last year
- Collection of presentation of my work on various platforms and meetups☆22Updated 6 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated last year
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- These are the slides and code for my tutorial "Computer Vision: an (Un?)Expected Journey" at PyData London 2018☆29Updated 6 years ago
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 9 years ago
- ☆11Updated 6 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- JupyterCon Missing Data Talk 2018☆23Updated 6 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- Live Twitter sentiment analysis using Python, Apache Spark Streaming, Kafka, NLTK, SocketIO☆20Updated 7 years ago
- Jupyter notebooks and code for Intro to DL talk at Genesys☆14Updated 8 years ago
- Project template for highly effective data science workflows☆29Updated 11 months ago
- Contains source files used in the Spark with Python course☆18Updated 5 years ago