Productionalizing Data Pipelines with Apache Airflow
☆116Jun 18, 2022Updated 3 years ago
Alternatives and similar repositories for productionalizing-data-pipelines-airflow
Users that are interested in productionalizing-data-pipelines-airflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Scripts to convert tables from SQL Server to Snowflake☆13Jun 27, 2019Updated 6 years ago
- This is where we put useful code for our daily job with data.☆28Mar 19, 2025Updated last year
- examples for a book by the same name☆29Jul 3, 2018Updated 7 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10May 5, 2022Updated 3 years ago
- Local AWS EMR - A local service that imitates AWS EMR☆27Jul 5, 2023Updated 2 years ago
- Datasets for Drug Discovery and Development☆10Aug 22, 2020Updated 5 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆101Aug 11, 2019Updated 6 years ago
- Various useful data structures in Python☆39Nov 14, 2019Updated 6 years ago
- ☆23Nov 17, 2019Updated 6 years ago
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- Spark cluster in docker containers with sample training Jupyter notebooks☆26Feb 24, 2023Updated 3 years ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Running a flask app and celery worker in the same docker container.☆31Feb 20, 2020Updated 6 years ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆11Dec 19, 2022Updated 3 years ago
- ☆10May 18, 2022Updated 3 years ago
- ☆11May 26, 2022Updated 3 years ago
- ☆17Nov 26, 2024Updated last year
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆34Feb 9, 2024Updated 2 years ago
- Sample code supporting the `Generating REST APIs from data classes in Python` blog post☆11May 20, 2024Updated last year
- Check out the dash visualization at https://dash-drug-explorer.plot.ly/out☆12Dec 26, 2022Updated 3 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Pet Match matches the user with a pet. When launched, this Alexa Skill will prompt the user for the information it needs to determine a m…☆23Dec 13, 2022Updated 3 years ago
- Profiles data in Snowflake tables and views including statistics, data classification and more.☆10Aug 21, 2025Updated 8 months ago
- An Open API that contains information about terpenes, the effects, and the cannabis varieties that contain them.☆13Mar 13, 2021Updated 5 years ago
- Demo that extends the FastUI example & adds database persistence☆16Jan 2, 2024Updated 2 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- ☆11Dec 12, 2019Updated 6 years ago
- ☆17Aug 29, 2018Updated 7 years ago
- NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)☆16May 20, 2023Updated 2 years ago
- Building a Parallelized NLP Data Pipeline with Metaflow☆10Sep 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A deep learning based bioinformatics project on epigenetics in Type 2 Diabetes.☆17Mar 25, 2023Updated 3 years ago
- Repo that will help you explore how to build a hybrid workflow using Apache Airflow and Amazon ECS Anywhere☆11Jul 12, 2022Updated 3 years ago
- Stream Processing Workshop☆23Jan 26, 2026Updated 3 months ago
- ☆10Jul 14, 2022Updated 3 years ago
- Proyecto de juguete para mostrar cómo realizar el setup de un proyecto de data science☆11Nov 24, 2022Updated 3 years ago
- fastapi-graphql☆13Sep 18, 2023Updated 2 years ago
- ☆12Oct 15, 2023Updated 2 years ago