rdempsey / pyspark-for-data-processingLinks
Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Updated 7 years ago
Alternatives and similar repositories for pyspark-for-data-processing
Users that are interested in pyspark-for-data-processing are comparing it to the libraries listed below
Sorting:
- pyspark sample scripts☆17Updated 6 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆37Updated 6 years ago
- Slides, code and more for my class: Data Analytics and Machine Learning on Big Data☆8Updated 7 years ago
- ☆40Updated 8 years ago
- These are the slides and code for my tutorial "Computer Vision: an (Un?)Expected Journey" at PyData London 2018☆29Updated 7 years ago
- Collection of presentation of my work on various platforms and meetups☆22Updated 6 years ago
- Few tutorials on pandas, matplotlib and seaborn☆27Updated 9 years ago
- ☆11Updated 6 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Project template for highly effective data science workflows☆29Updated last year
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- A machine learning algorithm written to predict severity of insurance claim☆20Updated 8 years ago
- A tutorial to create python based prediction web app☆30Updated 5 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- This is a machine learning challenge conducted by C&D Labs and Future Group in association with HackerEarth.☆10Updated 7 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆57Updated 9 years ago
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago
- ☆10Updated 6 years ago
- ☆19Updated 4 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆30Updated 4 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated 2 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- ☆26Updated last year
- Contains code for understanding TensorFlow workflow and basics☆51Updated 7 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆71Updated 2 years ago