KimaruThagna / ml-pipelines-airflow
Demonstrating and Building ML pipelines in Airflow
☆11Updated 3 years ago
Alternatives and similar repositories for ml-pipelines-airflow:
Users that are interested in ml-pipelines-airflow are comparing it to the libraries listed below
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- ☆26Updated 3 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆44Updated 2 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 3 years ago
- ☆12Updated 3 years ago
- Scaling Python Machine Learning☆45Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 8 months ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆17Updated 2 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆23Updated last year
- Streamlit application to explore Snowflake Tables☆39Updated last year
- An example MLFlow project☆48Updated 3 months ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Operations Research Algorithms☆17Updated last year
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆24Updated 6 months ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Updated 2 years ago
- Open Benchmarks for Evaluating the Performance of Feature Stores☆35Updated last year
- A Snowflake GPT Demo using SqlAlchemy☆23Updated last year
- Best practices for engineering ML pipelines.☆35Updated 2 years ago
- ☆36Updated 2 years ago
- PipeRider dbt workshop for DataTalksClub DE Zoomcamp☆17Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/cour…☆42Updated last year
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Updated 4 years ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago