piotrszul / spark-tutorial
Tutorial and examples for using Apache Spark
☆18Updated 7 years ago
Alternatives and similar repositories for spark-tutorial:
Users that are interested in spark-tutorial are comparing it to the libraries listed below
- ☆18Updated 6 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆62Updated 2 years ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Updated 2 years ago
- This is the code repo for the O'Reilly book "Data Science: The Hard Parts"☆13Updated 10 months ago
- Accelerate Deep Learning Workloads with Amazon SageMaker, published by Packt☆17Updated last year
- CentOS based Docker container for Time Series Analysis and Modeling.☆21Updated 5 years ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Updated 3 months ago
- Black Friday Sales Prediction & Thanksgiving App☆9Updated 4 years ago
- Data from the state of data science survey released by Anaconda each year.☆17Updated 8 months ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- An example MLFlow project☆48Updated 3 months ago
- Notebooks for the ValleyML Bootcamp (Aug 2019) "Statistical methods for data science"☆10Updated 5 years ago
- ☆65Updated 2 months ago
- Repository for medium article☆22Updated last year
- Jupyter notebooks for pyspark tutorials given at University☆107Updated 4 months ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆35Updated 4 years ago
- pycaret-demo-mlflow☆30Updated 4 years ago
- Talks about vaex☆36Updated 2 years ago
- Work for Mastering Large Datasets with Python☆19Updated 2 years ago
- A simple app to classify dogs using fastai and streamlit.☆17Updated 4 years ago
- ☆19Updated 4 years ago
- Best practices for engineering ML pipelines.☆35Updated 2 years ago
- "Building a Recommender System from Scratch" Workshop Material for PyDataDC 2018☆24Updated 6 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Sample projects using Ploomber.☆86Updated last year
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Updated 4 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago