astronomer / airflow-llm-demoLinks
☆13Updated 2 years ago
Alternatives and similar repositories for airflow-llm-demo
Users that are interested in airflow-llm-demo are comparing it to the libraries listed below
Sorting:
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆74Updated last year
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- Cost Efficient Data Pipelines with DuckDB☆58Updated 5 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆41Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆77Updated 4 years ago
- ☆12Updated 3 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Updated last year
- Code snippets for Data Engineering Design Patterns book☆256Updated 7 months ago
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer☆273Updated 3 months ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22Updated 3 years ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- ☆104Updated 9 months ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated this week
- Streamlit application to explore Snowflake Tables☆46Updated 2 years ago
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Updated 11 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆171Updated 3 weeks ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated 11 months ago
- Using LangChain's SQL Database Chain and Agent with various LLMs to perform Natural Language Queries (NLQ) of an Amazon RDS for PostgreSQ…☆48Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Updated 2 years ago
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆67Updated 4 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 6 months ago
- ☆23Updated 4 years ago
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆77Updated last year
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- An end-to-end workflow for processing streaming data on Azure.☆16Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆64Updated 2 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆164Updated 4 years ago
- Repo for CDC with debezium blog post☆29Updated last year