piotrszul / spark-tutorial
Tutorial and examples for using Apache Spark
β16Updated 7 years ago
Related projects β
Alternatives and complementary repositories for spark-tutorial
- This is the code repo for the O'Reilly book "Data Science: The Hard Parts"β10Updated 5 months ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API πβ53Updated 2 years ago
- Work for Mastering Large Datasets with Pythonβ18Updated last year
- Pyspark in Google Colab: A simple machine learning (Linear Regression) modelβ36Updated 5 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploymβ¦β60Updated last year
- β19Updated 6 years ago
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforestβ¦β58Updated 3 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and relatedβ56Updated 3 years ago
- Predict the number of deaths due to covid19 in the next two weeksβ11Updated 2 years ago
- Explore tips and tricks to deploy machine learning models with Docker.β13Updated last year
- Notebooks for the ValleyML Bootcamp (Aug 2019) "Statistical methods for data science"β10Updated 5 years ago
- A collection of Python scriptsβ13Updated 4 years ago
- Accelerate Deep Learning Workloads with Amazon SageMaker, published by Packtβ16Updated last year
- β17Updated 3 years ago
- Building simple ML apps with Streamlitβ25Updated 3 years ago
- Recurrent Neural Networks for Timeseriesβ24Updated 5 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publicationsβ52Updated 4 years ago
- Introduction to MLflow with a demo locally and how to set it on AWSβ42Updated 3 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"β28Updated 4 years ago
- Instant search for and access to many datasets in Pyspark.β34Updated 2 years ago
- Data mining algorithms with Pythonβ10Updated 5 years ago
- β29Updated 5 years ago
- β11Updated 5 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databaβ¦β15Updated last year
- Exploratory Data Analysis with Pandas and Python 3.x, published by Packtβ44Updated last year
- Cookiecutter template for testing Python scikit-learn classifiers.β32Updated 10 months ago
- Book Projectsβ24Updated 3 years ago
- Production repo to accompany Deep Learning with Structured Data book from Manning: https://www.manning.com/books/deep-learning-with-strucβ¦β72Updated 2 years ago
- Iowa House Prices Kaggle (top 5%)β13Updated 5 months ago
- Predicting the Likelihood to Purchase a Financial Product Following a Direct Marketing Campaignβ28Updated last year