roshankoirala / pySpark_tutorialLinks
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
☆30Updated 5 years ago
Alternatives and similar repositories for pySpark_tutorial
Users that are interested in pySpark_tutorial are comparing it to the libraries listed below
Sorting:
- ☆18Updated 7 years ago
- ☆18Updated 4 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆11Updated 5 years ago
- Building simple ML apps with Streamlit☆24Updated 4 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆161Updated last year
- Tutorials for using Tensorflow and Keras for NLP☆14Updated 7 years ago
- A simple example to showcase machine learning model deployment with an API☆10Updated 3 years ago
- ☆13Updated 4 years ago
- This repository is to host template for calculating ROI on Artificial Intelligence projects☆45Updated 6 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆17Updated 5 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Example MLOps using BentoML & mlFlow☆38Updated 4 years ago
- ☆11Updated 4 years ago
- Data analysis using numpy, pandas, matplotlib, seaborn, sqlite3, data wrangling☆31Updated 5 years ago
- Content related to Mastering Postgresql along with videos.☆18Updated 4 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 3 years ago
- Spark Databricks Notebooks☆14Updated 4 years ago
- ☆63Updated 7 years ago
- Building Recommendation Systems with Python [Video], by Packt Publishing☆90Updated 2 years ago
- Predict the number of deaths due to covid19 in the next two weeks☆11Updated 3 years ago
- code, labs and lectures for the course☆48Updated 2 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- This is code depository for my upcoming session. Will update details post the session☆40Updated 2 years ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Updated 9 months ago
- Laptop Prices Predictor is an end-to-end data science project that accurately predicts laptop prices using machine learning algorithms. T…☆14Updated last year
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- Course on Udemy by Jose Portilla☆98Updated 7 years ago
- DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU …☆10Updated 5 years ago
- Simple template showing how to set up docker for reproducible data science with Jupyter notebooks.☆23Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago