KimaruThagna / ml-pipelines-airflowLinks
Demonstrating and Building ML pipelines in Airflow
☆11Updated 4 years ago
Alternatives and similar repositories for ml-pipelines-airflow
Users that are interested in ml-pipelines-airflow are comparing it to the libraries listed below
Sorting:
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆21Updated last year
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆14Updated 3 years ago
- ☆58Updated 10 months ago
- ☆12Updated 3 years ago
- Analytics engineering with dbt - projects and developer environment☆18Updated 9 months ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 4 years ago
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆23Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Operations Research Algorithms☆17Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Cost Efficient Data Pipelines with DuckDB☆54Updated last month
- ☆18Updated 3 years ago
- ☆12Updated 3 years ago
- Code snippets for Data Engineering Design Patterns book☆119Updated 3 months ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Updated 2 years ago
- ☆35Updated last month
- build dw with dbt☆46Updated 8 months ago
- ☆49Updated 3 years ago
- This repository contains recipes for Apache Pinot.☆30Updated 4 months ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- ☆22Updated 3 months ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- A Python PySpark Projet with Poetry☆23Updated 9 months ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 9 months ago