KimaruThagna / ml-pipelines-airflowLinks
Demonstrating and Building ML pipelines in Airflow
☆11Updated 4 years ago
Alternatives and similar repositories for ml-pipelines-airflow
Users that are interested in ml-pipelines-airflow are comparing it to the libraries listed below
Sorting:
- ☆99Updated 8 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Cost Efficient Data Pipelines with DuckDB☆57Updated 4 months ago
- ☆27Updated 3 years ago
- Repo for CDC with debezium blog post☆29Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated this week
- Data Engineering with Spark and Delta Lake☆104Updated 2 years ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆52Updated 8 months ago
- Scaling Python Machine Learning☆50Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- The go to demo for public and private dbt Learn☆80Updated 6 months ago
- A Series of Notebooks on how to start with Kafka and Python☆152Updated 7 months ago
- Read Delta tables without any Spark☆47Updated last year
- New generation opensource data stack☆73Updated 3 years ago
- Demos of Materialize, the operational data warehouse.☆51Updated 7 months ago
- Code snippets for Data Engineering Design Patterns book☆207Updated 6 months ago
- open source data lake☆24Updated 8 months ago
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆67Updated 3 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Utility functions for dbt projects running on Spark☆33Updated 7 months ago
- A guide for leading a data (engineering) team☆63Updated last year
- Cloned by the `dbt init` task☆62Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆59Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- ☆12Updated 3 years ago
- ☆27Updated 3 years ago
- ☆60Updated last year
- Execution of DBT models using Apache Airflow through Docker Compose☆120Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago