astronomer / airflow-llm-demo
☆13Updated last year
Alternatives and similar repositories for airflow-llm-demo:
Users that are interested in airflow-llm-demo are comparing it to the libraries listed below
- Docker envinroment to stream data from Kafka to Iceberg tables☆27Updated last year
- Analytics engineering with dbt - projects and developer environment☆18Updated 7 months ago
- Make simple storing test results and visualisation of these in a BI dashboard☆43Updated last month
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- ☆21Updated 3 years ago
- ☆12Updated 3 years ago
- Streamlit application to explore Snowflake Tables☆39Updated last year
- Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstration☆19Updated 5 months ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆19Updated 2 years ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆68Updated 11 months ago
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Updated 4 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆32Updated last year
- Cost Efficient Data Pipelines with DuckDB☆52Updated 8 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆22Updated this week
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆25Updated last year
- Infrastructure automation to deploy Hadoop,Hive,Spark,airflow nodes on a docker host☆20Updated 6 years ago
- Code snippets for Data Engineering Design Patterns book☆80Updated last month
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆74Updated 3 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 8 months ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆26Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Yet Another (Spark) ETL Framework☆21Updated last year
- A package to run DuckDB queries from Apache Airflow.☆19Updated 10 months ago
- A Snowflake GPT Demo using SqlAlchemy☆23Updated last year
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆41Updated 5 months ago