dwarszawski / amundsen-atlas-types
Atlas custom type definitions
โ16Updated 3 years ago
Alternatives and similar repositories for amundsen-atlas-types:
Users that are interested in amundsen-atlas-types are comparing it to the libraries listed below
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Aโฆโ120Updated last week
- A simple Spark-powered ETL framework that just works ๐บโ181Updated last month
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.โ75Updated 11 months ago
- Spark on Kubernetes infrastructure Helm charts repoโ198Updated 2 years ago
- ACID Data Source for Apache Spark based on Hive ACIDโ97Updated 3 years ago
- DataQuality for BigDataโ144Updated last year
- โ63Updated 5 years ago
- Avro SerDe for Apache Spark structured APIs.โ233Updated 8 months ago
- The Workload Analyzer collects Prestoยฎ and Trino workload statistics, and analyzes themโ135Updated last year
- Data ingestion library for Amundsen to build graph and search indexโ205Updated last year
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productiveโ185Updated 2 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelinesโ120Updated this week
- Schema Registryโ15Updated 9 months ago
- Amundsen library to place common code for Amundsen microservices to shareโ9Updated 3 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are inโฆโ88Updated 11 months ago
- Snowflake Data Source for Apache Spark.โ222Updated 3 months ago
- The iterative broadcast join example code.โ69Updated 7 years ago
- Repository of helm charts for deploying DataHub on a Kubernetes clusterโ178Updated this week
- Multiple node presto cluster on docker containerโ124Updated 2 years ago
- Storage connector for Trinoโ106Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0โ97Updated 2 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines anโฆโ61Updated 6 months ago
- spark on kubernetesโ105Updated 2 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog aโฆโ212Updated last week
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.โ280Updated this week
- โ198Updated last year
- Spline agent for Apache Sparkโ191Updated last week
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMRโ174Updated last year
- The Internals of Spark on Kubernetesโ70Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.โ88Updated last year