pixelwrench / docker-luigiLinks
A data engineering pipeline for harvesting top author data from Medium
☆16Updated 6 years ago
Alternatives and similar repositories for docker-luigi
Users that are interested in docker-luigi are comparing it to the libraries listed below
Sorting:
- ☆16Updated 5 years ago
- Slides produced by Engineers and Data Scientists of Blue Yonder☆50Updated 5 years ago
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- Simple, light-weight data frames for Python☆26Updated 3 weeks ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 7 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- ☆15Updated 2 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 3 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- ☆29Updated 8 years ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated 2 years ago
- Code, slides, and documentation for the talks I have given.☆113Updated last month
- Derivatives models written with the Tributary data flow library☆23Updated last week
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- Exploratory code to see if we can learn about feature relationships in a DataFrame using machine learning☆55Updated 5 years ago
- Create matplotlib plots with the art style of Randall Munroe's xkcd☆86Updated 5 years ago
- An example mini data warehouse for python project stats, template for new projects☆179Updated 4 years ago
- A fork of the cookiecutter-data-science leveraging Docker for local development.☆131Updated 5 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- Accelerate data science☆116Updated 4 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆85Updated 2 years ago
- All kinds of survival analysis distributions and methods to optimize how long to wait for them.☆39Updated 4 years ago
- ☆86Updated 7 years ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago