Rakesh-Nagaraju / Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark.
This project aims to use the Hadoop framework to analyze unstructured data that we obtain from Twitter and perform sentiment and trend analysis using Hive on MapReduce and Spark on keyword “COVID19”. We then compare the Hive and Spark approaches to determine the best performance.
☆15Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark.
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆18Updated 3 years ago
- Laptop Prices Predictor is an end-to-end data science project that accurately predicts laptop prices using machine learning algorithms. T…☆14Updated 3 months ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆30Updated 3 years ago
- Slides and notebook for the workshop on serving bert models in production☆24Updated 2 years ago
- Example project with a CNN to train a Pokémon type classifier, adapted for DTC workshop☆34Updated 11 months ago
- Sentiment Analysis of COVID-19 Vaccine-related Twitter Data☆10Updated 3 years ago
- TensorFlow Serving + Streamlit!☆22Updated 3 years ago
- A modern, enterprise-ready business intelligence web application☆32Updated last year
- This is a solution that demonstrates how to train and deploy a pre-trained Huggingface model on AWS SageMaker and publish an AWS QuickSig…☆11Updated 2 years ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆39Updated last year
- The official repository of the book Data Storytelling with Python Altair and Generative AI☆15Updated 2 weeks ago
- Create Interactive Dashboards With Streamlit in Python☆15Updated 4 years ago
- A collection of my blogs on Data Science and Machine learning.☆84Updated 5 months ago
- Repository for GH public projects☆17Updated 8 months ago
- The repository for the course in Udemy☆17Updated 5 years ago
- Data visualization in Python with Matplotlib & Seaborn☆41Updated 2 years ago
- Build Deep Neural Network model in Keras and deploy a REST API to production with Flask on Google App Engine☆34Updated last year
- Book Projects☆24Updated 3 years ago
- An end-to-end tutorial to forecast the M5 dataset using feature engineering pipelines and gradient boosting.☆14Updated last year
- Awesome MLOps Course Outline☆32Updated last year
- A hands-on case study for demonstrating the stages involved in a machine learning project, from EDA to production.☆37Updated last year
- ☆7Updated 5 years ago
- ☆11Updated 3 years ago
- Kubeflow installation on windows 10/11☆16Updated last year
- Continuous Machine Learning with Kubeflow, published by BPB Publications☆14Updated 2 years ago
- ☆26Updated 5 years ago
- ☆17Updated 5 years ago
- A downloadable pdf containing summary of frequently used pandas operations.☆10Updated 4 years ago
- A simple app to classify dogs using fastai and streamlit.☆17Updated 3 years ago
- A New Interactive Approach to Learning Data Analysis☆66Updated last year