RichardAfolabi / Realtime-Data-Analytics-Using-Spark
Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc
☆52Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for Realtime-Data-Analytics-Using-Spark
- ☆26Updated 10 months ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 5 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆83Updated 5 years ago
- Create scalable machine learning applications to power a modern data-driven business using Spark☆60Updated last year
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 5 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆49Updated last year
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆154Updated last week
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆68Updated 8 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- ☆53Updated 2 years ago
- Real Time Twitter Sentiment Analysis Product☆21Updated 7 years ago
- Design/Implement stream/batch architecture on NYC taxi data | #DE☆26Updated 3 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 8 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- Build a flask app to server a machine learning model as a RESTful web service☆38Updated 7 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper☆81Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆46Updated last year
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Repository used for Spark Trainings☆53Updated last year
- PySpark Code for Hands-on Learners☆114Updated 5 years ago