mjhea0 / flask-spark-docker
Just a boilerplate for PySpark and Flask
☆35Updated 6 years ago
Alternatives and similar repositories for flask-spark-docker:
Users that are interested in flask-spark-docker are comparing it to the libraries listed below
- Repo for all my code on the articles I post on medium☆107Updated 2 years ago
- ☆16Updated 7 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 6 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- ☆26Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆54Updated 2 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆59Updated 6 years ago
- Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python☆23Updated 5 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 8 years ago
- ☆16Updated 2 years ago
- sample code for tech blog post "Porting Flask to FastAPI for ML Model Serving"☆28Updated last year
- AWS Big Data Certification☆25Updated 3 months ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Helping you get Airflow running in production.☆9Updated 5 years ago
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago