mjhea0 / flask-spark-dockerLinks
Just a boilerplate for PySpark and Flask
☆35Updated 6 years ago
Alternatives and similar repositories for flask-spark-docker
Users that are interested in flask-spark-docker are comparing it to the libraries listed below
Sorting:
- ☆16Updated 7 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- ☆16Updated 4 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆87Updated 6 years ago
- Repo for all my code on the articles I post on medium☆107Updated 2 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python☆23Updated 5 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- ☆16Updated 2 years ago
- ☆54Updated 6 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- ☆49Updated 3 years ago
- PySpark Cookbook, published by Packt☆92Updated 2 years ago
- An example mini data warehouse for python project stats, template for new projects☆179Updated 4 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago