Rakesh-Nagaraju / Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark.Links
This project aims to use the Hadoop framework to analyze unstructured data that we obtain from Twitter and perform sentiment and trend analysis using Hive on MapReduce and Spark on keyword “COVID19”. We then compare the Hive and Spark approaches to determine the best performance.
☆16Updated 5 years ago
Alternatives and similar repositories for Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark.
Users that are interested in Twitter-Data-Analysis-on-COVID19-using-Hadoop-Flume-Hive-and-Spark. are comparing it to the libraries listed below
Sorting:
- Build Deep Neural Network model in Keras and deploy a REST API to production with Flask on Google App Engine☆33Updated 2 years ago
- Production-Ready Applied Deep Learning☆91Updated last week
- A collection of my blogs on Data Science and Machine learning.☆86Updated 11 months ago
- A repository containing data and files for my stories on Medium.com.☆58Updated 9 months ago
- Machine Learning Engineering Camp 2022☆39Updated 2 years ago
- Example project with a CNN to train a Pokémon type classifier, adapted for DTC workshop☆36Updated last year
- Coursera Advanced Machine Learning Specialization by National Research University Higher School of Economics☆36Updated 2 years ago
- Machine Learning Model Serving Patterns and Best Practices☆35Updated last week
- Public notebooks and datasets to accompany the Data Analysis with Polars course on Udemy☆45Updated 2 years ago
- ☆66Updated 6 months ago
- Cleaning Data for Effective Data Science, published by Packt☆100Updated last week
- Comet for Data Science, published by Packt☆42Updated last week
- Collection of Open Source projects in 2020☆65Updated last year
- A pipeline to detect data drift and retrain the model when there is drift☆24Updated 2 years ago
- Machine Learning for Streaming Data with Python, published by Packt☆73Updated last week
- A repository to keep track of all the code that I end up writing for my blog posts.☆251Updated 2 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆32Updated 4 years ago
- A set of jupyter notebooks☆24Updated 11 months ago
- A Graduate Level Three Week Bootcamp on AWS☆57Updated 10 months ago
- Hands-On Statistics for Data Science, published by Packt☆34Updated last week
- ☆12Updated 2 years ago
- Machine Learning Engineering on AWS, published by Packt☆71Updated last week
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆48Updated 4 years ago
- ☆87Updated last year
- Automated Machine Learning on AWS, published by Packt☆45Updated last year
- This is code depository for my upcoming session. Will update details post the session☆40Updated 2 years ago
- Slides and notebook for the workshop on serving bert models in production☆25Updated 3 years ago
- An end-to-end project on customer segmentation☆83Updated 2 years ago
- ☆31Updated 2 years ago
- Slides for "Feature engineering for time series forecasting" talk☆62Updated 3 years ago