amundsen-io / amundsendatabuilderLinks
Data ingestion library for Amundsen to build graph and search index
☆205Updated last year
Alternatives and similar repositories for amundsendatabuilder
Users that are interested in amundsendatabuilder are comparing it to the libraries listed below
Sorting:
- Metadata service library for Amundsen☆83Updated 2 weeks ago
- Front-end service library for Amundsen☆280Updated 2 weeks ago
- Search service library for Amundsen☆54Updated 2 weeks ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- ☆200Updated last year
- Snowflake Data Source for Apache Spark.☆226Updated last month
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆479Updated last week
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 3 months ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Updated 4 years ago
- Generate and Visualize Data Lineage from query history☆325Updated last year
- Airflow support for Marquez☆31Updated 4 years ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆589Updated last year
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Airflow Unit Tests and Integration Tests☆260Updated 2 years ago
- Airflow declarative DAGs via YAML☆133Updated last year
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆266Updated 4 months ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated 2 years ago
- Astronomer Core Docker Images☆107Updated last year
- Performant Redshift data source for Apache Spark☆142Updated last month
- Data Lineage Tracking And Visualization Solution☆638Updated last week
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆158Updated 2 years ago
- A guide to running Airflow on Kubernetes☆173Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆440Updated 2 weeks ago
- Multiple node presto cluster on docker container☆124Updated 3 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆235Updated 2 years ago
- Spark package for checking data quality☆221Updated 5 years ago