ammarchalifah / airflow-pipeline
Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.
☆15Updated 3 years ago
Alternatives and similar repositories for airflow-pipeline:
Users that are interested in airflow-pipeline are comparing it to the libraries listed below
- Schedule a data pipeline in Google Cloud using cloud function, BigQuery, cloud storage, cloud scheduler, stack trace, cloud build, and p…☆26Updated 5 years ago
- ☆23Updated 5 years ago
- A few end to end examples that use data-describe☆16Updated last year
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Demo on how to use Prefect 2 in an ML project☆41Updated 2 years ago
- ☆15Updated last year
- Using Python and Flourish to visualize rank and revenue trends of the world’s largest companies☆13Updated 10 months ago
- Full stack data engineering tools and infrastructure set-up☆49Updated 4 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Updated 3 years ago
- Prefect integrations for working with OpenAI.☆36Updated 9 months ago
- Serverless Superset on Google Cloud☆22Updated 3 years ago
- A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing☆27Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 6 months ago
- This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. Ther…☆20Updated last week
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆18Updated 9 months ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- Orchestration of data science and earth observation models in Apache Airflow, scale-up with Celery Executor, experiment with jupyter note…☆35Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆75Updated last year
- ☆13Updated last month
- Data lake, data warehouse on GCP☆55Updated 3 years ago
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year
- how to unit test your PySpark code☆28Updated 3 years ago
- ☆17Updated last year
- Run Apache Airflow on OpenShift☆14Updated 3 years ago
- TensorFlow Serving + Streamlit!☆22Updated 3 years ago