kadnan / airflow-scrapingLinks
Using Apache Airflow to schedule web scrapers
☆42Updated 6 years ago
Alternatives and similar repositories for airflow-scraping
Users that are interested in airflow-scraping are comparing it to the libraries listed below
Sorting:
- ETL with Python - Taught at DWH course 2017 (TAU)☆103Updated 7 years ago
- ☆111Updated 6 months ago
- Example of an ETL Pipeline using Airflow☆35Updated 7 years ago
- A tutorial on streaming data from a Flask REST API and streaming the response into PostgreSQL☆39Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- A curated list of awesome customer analytics content☆98Updated 7 years ago
- Simple alert system implemented in Kafka and Python☆96Updated 7 years ago
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆186Updated last year
- ☆46Updated 3 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆343Updated 6 years ago
- Jupyter notebooks for pyspark tutorials given at University☆108Updated this week
- Jupyter notebook for scraping and analysis of most in demand job technologies skills for data scientists.☆47Updated 5 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Web Scraping with Beautiful Soup and Selenium☆131Updated last year
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- 🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & …☆213Updated 2 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆88Updated 6 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Airflow basics tutorial☆397Updated 3 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 6 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 6 years ago
- A complete development environment setup for working with Airflow☆128Updated 2 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- Airflow ETL for Meetup API☆45Updated 6 years ago
- Explore 120 million taxi trips in real time with Dash and Vaex☆117Updated 4 years ago
- Python wrapper for Goodreads API☆30Updated 5 years ago