hanee-shousha / TwitterStreaming
Processing tweets using Spark Streaming and identifying top trending hashtags using a real-time simple dashboard
☆41Updated 3 years ago
Alternatives and similar repositories for TwitterStreaming:
Users that are interested in TwitterStreaming are comparing it to the libraries listed below
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 6 years ago
- ☆150Updated 7 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 5 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 8 years ago
- ☆37Updated 8 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 5 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Jupyter notebooks for pyspark tutorials given at University☆107Updated 4 months ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- Updated repository☆157Updated 3 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- A way for home buyers to know about factors affecting a state☆47Updated 6 years ago
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Updated 5 years ago
- Live Twitter sentiment analysis using Python, Apache Spark Streaming, Kafka, NLTK, SocketIO☆20Updated 7 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- ETL pipeline using pyspark (Spark - Python)☆115Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆54Updated last year
- helpful resources for (big) data science☆33Updated 3 years ago
- Simple alert system implemented in Kafka and Python☆95Updated 6 years ago
- AWS Big Data Certification☆25Updated 3 months ago
- Repo for all my code on the articles I post on medium☆107Updated 2 years ago