hanee-shousha / TwitterStreaming
Processing tweets using Spark Streaming and identifying top trending hashtags using a real-time simple dashboard
☆41Updated 2 years ago
Alternatives and similar repositories for TwitterStreaming:
Users that are interested in TwitterStreaming are comparing it to the libraries listed below
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 5 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- ☆148Updated 6 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Repository used for Spark Trainings☆53Updated last year
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆41Updated 5 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Contains source files used in the Spark with Python course☆18Updated 5 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆122Updated 2 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ETL pipeline using pyspark (Spark - Python)☆113Updated 4 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 6 years ago
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Updated 5 years ago
- Analyze and visualize Twitter Sentiment on a world map using Spark MLlib☆138Updated 3 years ago
- Data Quest - Data Engineer Learning and Projects☆24Updated 5 years ago
- A way for home buyers to know about factors affecting a state☆47Updated 6 years ago
- ☆25Updated 6 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- code, labs and lectures for the course☆46Updated last year