garystafford / datahub-on-aws-demoLinks
DataHub on AWS demonstration resources
☆10Updated 2 years ago
Alternatives and similar repositories for datahub-on-aws-demo
Users that are interested in datahub-on-aws-demo are comparing it to the libraries listed below
Sorting:
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- ☆11Updated 6 months ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- dbt package for monitoring airflow DAGs and tasks☆29Updated 4 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- ☆18Updated last year
- ☆31Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- A kind data platform on your local machine. 🤗☆10Updated last week
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated 2 years ago
- Showcases the AsyncIO Functionality within Apache Flink for Kinesis Data Analytics☆10Updated 5 months ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Big Data Demystified meetup and blog examples☆31Updated 10 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆26Updated 7 months ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Updated this week
- Fully unit tested utility functions for data engineering. Python 3 only.☆17Updated 10 months ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Updated last year
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 9 months ago
- IceRunner is an Apache Arrow Flight Server Implementation for Apache Iceberg Tables☆9Updated 2 months ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- ☆12Updated 10 months ago
- Glue VSCode devcontainer setup☆14Updated 2 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- Using the Parquet file format with Python☆15Updated last year
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆38Updated 4 months ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- dlt-dagster-demo☆11Updated last year