roshankoirala / pySpark_tutorial
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
☆29Updated 4 years ago
Alternatives and similar repositories for pySpark_tutorial:
Users that are interested in pySpark_tutorial are comparing it to the libraries listed below
- ☆19Updated 6 years ago
- A simple example to showcase machine learning model deployment with an API☆10Updated 2 years ago
- Kubeflow installation on windows 10/11☆16Updated 2 years ago
- Book Projects☆25Updated 3 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the GIDS AIML Conference -2020 Edition☆23Updated 3 years ago
- ☆11Updated 4 years ago
- The repository of the book: Deep Learning with Python by Francois Chollet☆16Updated 5 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago
- ☆18Updated 3 years ago
- Building simple ML apps with Streamlit☆24Updated 3 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- This repository is to host template for calculating ROI on Artificial Intelligence projects☆44Updated 5 years ago
- Customer-base segmentation over e-commerce sales data☆25Updated 4 years ago
- This is the repository containing machine learning and deep learning projects, as well as some presentation slides on these topics.☆13Updated 8 months ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Contains slides and hands-on tutorials for understanding and implementing Transformers in Natural Language Processing. Uses the HuggingFa…☆27Updated 4 years ago
- This workshop was done as a part of the 1729 conference organized by Fractal Analytics and Analytics Vidhya. Key content covered was hand…☆21Updated 2 years ago
- Using Extractive summarization to summarize medium posts☆11Updated 5 years ago
- The repository for the course in Udemy☆17Updated 5 years ago
- Natural Language Processing☆28Updated 11 months ago
- n this machine learning pricing project, we implement a retail price optimization algorithm using regression trees. This is one of the fi…☆18Updated 4 years ago
- This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded …☆27Updated 3 years ago
- This is code depository for my upcoming session. Will update details post the session☆40Updated 2 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆11Updated 4 years ago
- In this Complete process in machine learning is discussed and done with pyspark .☆18Updated 4 years ago
- My Study Collection data science courses, Article etc.☆30Updated 5 years ago
- Notebooks for the ValleyML Bootcamp (Aug 2019) "Statistical methods for data science"☆10Updated 5 years ago
- Course on Udemy by Jose Portilla☆97Updated 7 years ago
- ☆11Updated 2 years ago
- Detailed Tensorflow2 Object Detection Tutorial Step by Step Explained☆23Updated 4 years ago