ryandawsonuk / data-platforms-tools
Guide to data platforms and tools
☆31Updated 2 years ago
Alternatives and similar repositories for data-platforms-tools:
Users that are interested in data-platforms-tools are comparing it to the libraries listed below
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- Data Mesh Architecture☆74Updated 7 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- ☆42Updated 4 years ago
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 8 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆24Updated 6 months ago
- NiFi Processor for Apache Pulsar☆10Updated 3 months ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 3 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- Useful scripts, utilities, and tools for Snowflake☆13Updated 4 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆28Updated 3 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆30Updated 2 weeks ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- A curated list of data engineering tools for software developers☆10Updated 6 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Data Mesh Manager (Community Edition)☆32Updated last month
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆19Updated this week
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 7 months ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- ☆13Updated last year
- ☆15Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- ☆38Updated 9 months ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- Events about the open source data stack☆13Updated 2 years ago