gavinln / airflow-local
An Ubuntu Vagrant Virtual Machine (VM) with Airflow, a data workflow management system from Airbnb
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for airflow-local
- Deploy sentiment analysis using Flask☆17Updated 5 years ago
- A python client library for the Stitch Import API☆42Updated 10 months ago
- Multidimensional data explorer and visualization tool.☆52Updated 7 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Updated 6 years ago
- ☆14Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- Hierarchical Clustering Algorithms☆35Updated 2 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 4 years ago
- Build your feature store with macros right within your dbt repository☆37Updated last year
- How to do data science with Optimus, Spark and Python.☆18Updated 5 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Extend pandas to_sql function to perform multi-threaded, concurrent "insert or update" command in memory☆84Updated 7 months ago
- PyData London 2016 material☆37Updated 8 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 6 years ago
- ☆10Updated 4 years ago
- dagster scikit-learn pipeline example.☆43Updated last year
- Predict age and gender from a first name☆60Updated 6 years ago
- Tough and flexible tools for data analysis, transformation, validation and movement.☆136Updated 9 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago