kadnan / airflow-scraping
Using Apache Airflow to schedule web scrapers
☆42Updated 6 years ago
Alternatives and similar repositories for airflow-scraping
Users that are interested in airflow-scraping are comparing it to the libraries listed below
Sorting:
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ☆110Updated 4 months ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializi…☆32Updated 5 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Example of an ETL Pipeline using Airflow☆34Updated 7 years ago
- Python client for the DSS public API☆41Updated this week
- A tutorial on streaming data from a Flask REST API and streaming the response into PostgreSQL☆39Updated 5 years ago
- A curated list of awesome customer analytics content☆97Updated 7 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Runnable e-commerce mini data warehouse based on Python, PostgreSQL & Metabase, template for new projects☆29Updated 4 years ago
- ETL with Python - Taught at DWH course 2017 (TAU)☆103Updated 7 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Cloned by the `dbt init` task☆61Updated last year
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆186Updated last year
- Airflow training for the crunch conf☆105Updated 6 years ago
- Google Trends + Python + Google Sheets API + Tableau Public = Full Automation☆31Updated 7 years ago
- Python wrapper for Goodreads API☆29Updated 5 years ago
- ☆23Updated 6 years ago
- ☆46Updated 3 years ago
- Tutorial like code for how to deploy airflow using docker and how to use the DockerOperator.☆44Updated 5 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Machine Learning Virtual Machine (provisioned with Vagrant) for building Spark Notebook applications☆54Updated last month
- ☆54Updated 6 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 6 years ago
- Python Notes on IPython Notebook files.☆37Updated 4 years ago
- ☆27Updated 5 years ago