ryandawsonuk / data-platforms-toolsLinks
Guide to data platforms and tools
☆32Updated 3 years ago
Alternatives and similar repositories for data-platforms-tools
Users that are interested in data-platforms-tools are comparing it to the libraries listed below
Sorting:
- Receipes of publicly-available Jupyter images☆8Updated 3 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- Terraform scripts for deploying Apiary Data Lake☆19Updated last week
- ☆11Updated last year
- Example project using DBT, Databricks and AdventureWorks sample database☆12Updated 2 years ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- AWS Quick Start Team☆19Updated 8 months ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- ☆14Updated 4 years ago
- Events about the open source data stack☆13Updated 3 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- ☆18Updated last year
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 6 years ago
- A curated list of awesome Databricks resources, including Spark☆20Updated last year
- NiFi Processor for Apache Pulsar☆10Updated 7 months ago
- A Thinnest Viable Platform (TVP) as described in Team Topologies, using just a Wiki page for a data platform.☆16Updated 4 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated 5 months ago
- ☆90Updated 5 months ago
- Data Mesh Architecture☆79Updated 11 months ago
- ☆96Updated last year
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆28Updated 3 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆21Updated this week
- Curated list of resources about Apache Airflow☆19Updated 4 years ago
- Distributed Data Mesh 2.0 | DataMesh-as-a-Code on Cloud | Theory to Industrialization☆38Updated 2 years ago
- Script to retrieve the list of AWS Services and their one-line descriptions☆38Updated 4 years ago