mjhea0 / flask-spark-dockerLinks
Just a boilerplate for PySpark and Flask
β35Updated 7 years ago
Alternatives and similar repositories for flask-spark-docker
Users that are interested in flask-spark-docker are comparing it to the libraries listed below
Sorting:
- π¨ Simple, self-contained fraud detection system built with Apache Kafka and Pythonβ89Updated 6 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etcβ51Updated 9 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streamingβ55Updated 6 years ago
- Repo for all my code on the articles I post on mediumβ107Updated 2 years ago
- Udacity Data Pipeline Exercisesβ15Updated 5 years ago
- Simple alert system implemented in Kafka and Pythonβ96Updated 7 years ago
- β16Updated 7 years ago
- Repository used for Spark Trainingsβ54Updated 2 years ago
- Code to build a simple analytics data pipeline with Pythonβ102Updated 8 years ago
- scaffold of Apache Airflow executing Docker containersβ86Updated 2 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/β24Updated 2 years ago
- sample code for tech blog post "Porting Flask to FastAPI for ML Model Serving"β28Updated 2 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nlβ71Updated 2 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtagβ29Updated 9 years ago
- Basic tutorial of using Apache Airflowβ36Updated 7 years ago
- π π¨ Airflow tutorial for PyCon 2019β85Updated 2 years ago
- Airflow workflow management platform chef cookbook.β71Updated 6 years ago
- Docker container for Kafka - Spark Streaming - Cassandraβ98Updated 6 years ago
- PySpark Code for Hands-on Learnersβ116Updated 5 years ago
- Docker compose files for various kafka stacksβ32Updated 7 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Dataβ20Updated 7 years ago
- Interactive dashboard that show a decision support system to help DYCD/DOEβs award RFPs for the 2015 SONYC expansion.β38Updated 3 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apachβ¦β19Updated 9 years ago
- β48Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β55Updated 2 years ago
- An example mini data warehouse for python project stats, template for new projectsβ178Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggleβ33Updated 9 years ago
- PySpark Cookbook, published by Packtβ93Updated 2 years ago
- This is a simple streaming application that utilises Kafka and Pythonβ46Updated 6 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Streamβ68Updated 8 years ago