lucrussell / docker-luigi
A data engineering pipeline for harvesting top author data from Medium
☆16Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for docker-luigi
- ☆16Updated 4 years ago
- This is a simple experiment designed to uncover which technical indicators are the most important.☆13Updated 5 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- AsyncIO serving for data science models☆24Updated last year
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆55Updated 3 years ago
- Pandas-SQLAlchemy integration☆28Updated 8 months ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- A Parallel Process Queue for Python☆44Updated last year
- An in-depth introduction to Pandas' MultiIndexes and practical code snippets☆55Updated 6 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- ☆24Updated 6 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 6 years ago
- This script pulls the gasoline price time series (from the EIA), and performs unsupervised time series anomaly detection using a variety …☆12Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Derivatives models written with the Tributary data flow library☆22Updated 9 months ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- All kinds of survival analysis distributions and methods to optimize how long to wait for them.☆39Updated 3 years ago
- ☆30Updated 6 years ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated last year
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated last year
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆84Updated last year
- Code repository supporting the medium blog☆13Updated 4 years ago
- Exploratory code to see if we can learn about feature relationships in a DataFrame using machine learning☆55Updated 5 years ago
- A pandas.DataFrame-based ORM.☆84Updated 2 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 3 years ago