piotrszul / spark-tutorial
Tutorial and examples for using Apache Spark
☆18Updated 7 years ago
Alternatives and similar repositories for spark-tutorial:
Users that are interested in spark-tutorial are comparing it to the libraries listed below
- ☆19Updated 6 years ago
- ☆19Updated 3 years ago
- Notebooks for the ValleyML Bootcamp (Aug 2019) "Statistical methods for data science"☆10Updated 5 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆29Updated 4 years ago
- ☆17Updated 4 years ago
- Accelerate Deep Learning Workloads with Amazon SageMaker, published by Packt☆17Updated last year
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- This is the code repo for the O'Reilly book "Data Science: The Hard Parts"☆13Updated 8 months ago
- Best practices for engineering ML pipelines.☆37Updated 2 years ago
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest…☆58Updated 3 years ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Updated last month
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆24Updated 2 years ago
- This is the repository containing machine learning and deep learning projects, as well as some presentation slides on these topics.☆13Updated 8 months ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- ☆21Updated last year
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆29Updated 4 years ago
- Public Repo of my machine learning project to predict home prices☆12Updated 4 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆36Updated 5 years ago
- My fastai blog☆21Updated last year
- Awesome list and projects of Time Series☆27Updated last year
- Black Friday Sales Prediction & Thanksgiving App☆9Updated 4 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Hands on Unsupervised Learning with Python [Video], Published by Packt☆29Updated 2 years ago
- A simple app to classify dogs using fastai and streamlit.☆17Updated 4 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆46Updated last year
- Code repository for Python for Beginners: Learn Python from Scratch, published by Packt☆13Updated last year
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- ☆11Updated last year
- Modeling and Simulation in Python and MATLAB/Octave☆12Updated 3 years ago