ryandawsonuk / data-platforms-tools
Guide to data platforms and tools
☆32Updated 3 years ago
Alternatives and similar repositories for data-platforms-tools:
Users that are interested in data-platforms-tools are comparing it to the libraries listed below
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- A curated list of data engineering tools for software developers☆10Updated 6 years ago
- NiFi Processor for Apache Pulsar☆10Updated 4 months ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Curated list of resources about Apache Airflow☆19Updated 3 years ago
- ☆42Updated 4 years ago
- Data Mesh Architecture☆74Updated 8 months ago
- ☆96Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- Terraform scripts for deploying Apiary Data Lake☆19Updated 2 weeks ago
- ☆11Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- This repo contains the LookML for the model and dashboards used with the FHIR healthcare dataset to showcase how Looker can add value to …☆11Updated 2 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆25Updated 7 months ago
- ☆14Updated last year
- A list about Apache Kafka☆8Updated 4 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake So…☆42Updated 4 months ago
- Jupyter Notebooks with Snowpark☆15Updated 3 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- A Thinnest Viable Platform (TVP) as described in Team Topologies, using just a Wiki page for a data platform.☆16Updated 4 years ago
- In this repository, we show how to get started with data lineage on AWS using OpenLineage. This is an AWS Cloud Development Kit project (…☆12Updated 8 months ago
- AWS Quick Start Team☆18Updated 6 months ago
- Practical Model-Driven Enterprise Architecture, published by Packt☆37Updated 2 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- A bunch of hacks developed around dbt☆48Updated 5 years ago