pixelwrench / docker-luigiLinks
A data engineering pipeline for harvesting top author data from Medium
☆16Updated 7 years ago
Alternatives and similar repositories for docker-luigi
Users that are interested in docker-luigi are comparing it to the libraries listed below
Sorting:
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 7 years ago
- Slides produced by Engineers and Data Scientists of Blue Yonder☆51Updated 6 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 4 years ago
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆85Updated 2 years ago
- This script pulls the gasoline price time series (from the EIA), and performs unsupervised time series anomaly detection using a variety …☆12Updated 6 years ago
- Just a boilerplate for PySpark and Flask☆36Updated 7 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 2 years ago
- ☆15Updated 3 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- Read better test failures.☆116Updated last year
- Code and notebooks for a talk given at PyBay, 2018-08-19☆49Updated 4 years ago
- A fork of the cookiecutter-data-science leveraging Docker for local development.☆131Updated 6 years ago
- An example mini data warehouse for python project stats, template for new projects☆178Updated 5 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆53Updated 9 years ago
- A Jupyter notebook to accompany Jake VanderPlas's "Statistics for Hackers" talk from PyCon 2016.☆78Updated 7 years ago
- An in-depth introduction to Pandas' MultiIndexes and practical code snippets☆57Updated 7 years ago
- Exploratory code to see if we can learn about feature relationships in a DataFrame using machine learning☆55Updated 6 years ago
- Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.☆123Updated 3 weeks ago
- ☆87Updated 7 years ago
- Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializi…☆32Updated 6 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated 2 years ago
- Example of an ETL Pipeline using Airflow☆38Updated 8 years ago
- ☆61Updated 9 years ago
- Airflow workflow management platform chef cookbook.☆70Updated 6 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 7 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 4 years ago
- ☆101Updated 7 years ago
- Code repository supporting the medium blog☆12Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆89Updated 6 years ago