astronomer / airflow-guide-passing-data-between-tasks
☆10Updated 3 years ago
Alternatives and similar repositories for airflow-guide-passing-data-between-tasks:
Users that are interested in airflow-guide-passing-data-between-tasks are comparing it to the libraries listed below
- Demo of Streamlit application with Databricks SQL Endpoint☆36Updated 2 years ago
- Machine Learning Ops Project☆29Updated 10 months ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Updated last month
- Deploying a Machine Learning model streaming application with Apache Kafka☆10Updated 2 years ago
- Data pipeline for extracting, transforming, and visualising Covid-19 data☆14Updated last year
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆46Updated last year
- A pipeline to detect data drift and retrain the model when there is drift☆23Updated last year
- This repo gives an introduction to setting up streaming analytics using open source technologies☆24Updated last year
- Create a local dashboard to visualize and filter your GitHub feed☆29Updated 2 years ago
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆55Updated 2 years ago
- Project for real-time anomaly detection using Kafka and python☆59Updated 2 years ago
- build dw with dbt☆36Updated 3 months ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆43Updated 2 years ago
- Template for data pipelines, ML workflows, API dev and monitoring☆45Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆59Updated last year
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆132Updated last year
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- ☆32Updated last year
- This is an overview of a MLOps architecture that includes both Airflow and MLflow running on separate Docker containers.☆21Updated 2 years ago
- Minimalistic text search engine that uses sklearn and pandas☆19Updated 2 months ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- A quick reference guide to the most commonly used patterns and functions in PySpark SQL☆54Updated 3 years ago
- This is code depository for my upcoming session. Will update details post the session☆40Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆58Updated 6 months ago
- A list of all my posts and personal projects☆69Updated 8 months ago
- ☆26Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆107Updated 2 years ago
- ☆28Updated last year