vsmolyakov / pyspark
spark (scala and python)
☆18Updated 5 years ago
Alternatives and similar repositories for pyspark
Users that are interested in pyspark are comparing it to the libraries listed below
Sorting:
- pyspark sample scripts☆17Updated 6 years ago
- ☆26Updated last year
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Slides and notebooks for my tutorial at PyData London 2018☆21Updated 6 years ago
- This is all my random garbage.☆26Updated last year
- ☆19Updated 4 years ago
- Visualization ideas for data science☆20Updated 7 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- Bayesian statistics seminars☆30Updated 8 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- ☆16Updated 4 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 8 years ago
- ☆13Updated 7 years ago
- ☆15Updated 2 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- A tutorial to create python based prediction web app☆30Updated 5 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- DeepLearningfromScratch2018☆20Updated 6 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- In-class exercises for Deep Learning course at NYC Data Science Academy☆32Updated 7 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Algorithms and Codes for my machine learning blog☆30Updated last year
- notebooks for nlp-on-spark☆13Updated 8 years ago
- ☆14Updated 10 years ago
- A Python Package for data processing and building ML models, primarily based on pandas and sklearn libraries.☆17Updated 5 years ago
- Repo for PyData 2019 Tutorial - New Trends in Estimation and Inference☆25Updated 5 years ago
- Introduction to structured prediction with Python and pystruct☆18Updated 6 years ago