piotrszul / spark-tutorialLinks
Tutorial and examples for using Apache Spark
☆18Updated 7 years ago
Alternatives and similar repositories for spark-tutorial
Users that are interested in spark-tutorial are comparing it to the libraries listed below
Sorting:
- ☆18Updated 7 years ago
- Notebooks for the ValleyML Bootcamp (Aug 2019) "Statistical methods for data science"☆10Updated 5 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆62Updated 2 years ago
- Exploratory Data Analysis with Pandas and Python 3.x, published by Packt☆44Updated 2 years ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Updated 2 years ago
- Work for Mastering Large Datasets with Python☆19Updated 2 years ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- Book Projects☆24Updated 4 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆30Updated 5 years ago
- Data Analysis and Exploration with Pandas, published by Packt☆17Updated 4 years ago
- Black Friday Sales Prediction & Thanksgiving App☆9Updated 4 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆31Updated 5 years ago
- Production repo to accompany Deep Learning with Structured Data book from Manning: https://www.manning.com/books/deep-learning-with-struc…☆73Updated 3 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆82Updated last year
- Talks about vaex☆36Updated 2 years ago
- ☆15Updated 2 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Updated 6 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- PySpark Cheatsheet☆36Updated 2 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆46Updated 4 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- ☆21Updated last year
- spark (scala and python)☆18Updated 5 years ago
- Hands-On Big Data Analytics with PySpark, Published by Packt☆35Updated 2 years ago
- ☆19Updated 4 years ago
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest…☆58Updated 3 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago