RichardAfolabi / Realtime-Data-Analytics-Using-Spark
Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc
☆51Updated 8 years ago
Alternatives and similar repositories for Realtime-Data-Analytics-Using-Spark:
Users that are interested in Realtime-Data-Analytics-Using-Spark are comparing it to the libraries listed below
- ☆26Updated last year
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Some class materials for a data processing course using PySpark☆52Updated 2 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Apache Spark Interview Question and Answers☆20Updated 4 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- code, labs and lectures for the course☆46Updated last year
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- Repository used for Spark Trainings☆53Updated last year
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 6 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 6 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 5 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆69Updated 8 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Updated repository☆157Updated 3 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 8 years ago