kadnan / airflow-scrapingLinks
Using Apache Airflow to schedule web scrapers
☆42Updated 6 years ago
Alternatives and similar repositories for airflow-scraping
Users that are interested in airflow-scraping are comparing it to the libraries listed below
Sorting:
- ETL with Python - Taught at DWH course 2017 (TAU)☆103Updated 8 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- A curated list of awesome customer analytics content☆98Updated 7 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated 2 years ago
- Simple alert system implemented in Kafka and Python☆97Updated 7 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- ☆112Updated 8 months ago
- Example DAGs using hooks and operators from Airflow Plugins☆346Updated 7 years ago
- Example of an ETL Pipeline using Airflow☆36Updated 8 years ago
- ☆46Updated 3 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- Scraps jobs listings from Glassdoor☆33Updated 5 years ago
- 🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & …☆214Updated 2 years ago
- Simple Python examples including data analysis, ETL, web scraping☆76Updated 2 years ago
- Code, slides, and documentation for the talks I have given.☆113Updated 2 months ago
- Jupyter notebook for scraping and analysis of most in demand job technologies skills for data scientists.☆47Updated 5 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆88Updated 6 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 6 years ago
- [Book-2019] Pragmatic AI: An Introduction to Cloud-based Machine Learning☆137Updated 7 months ago
- Airflow basics tutorial☆397Updated 3 years ago
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆186Updated last year
- Airflow training for the crunch conf☆105Updated 6 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- ☆26Updated 5 years ago
- A tutorial on streaming data from a Flask REST API and streaming the response into PostgreSQL☆39Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 7 years ago
- A curated list of repositories for my book Machine Learning Solutions.☆79Updated 7 years ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago