amundsen-io / amundsendatabuilder
Data ingestion library for Amundsen to build graph and search index
☆206Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for amundsendatabuilder
- Metadata service library for Amundsen☆83Updated last year
- Search service library for Amundsen☆54Updated 6 months ago
- Front-end service library for Amundsen☆280Updated 5 months ago
- ☆196Updated last year
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆262Updated last year
- Amundsen library to place common code for Amundsen microservices to share☆9Updated 3 years ago
- Airflow support for Marquez☆32Updated 3 years ago
- Snowflake Data Source for Apache Spark.☆218Updated this week
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated 5 months ago
- Generate and Visualize Data Lineage from query history☆311Updated last year
- Astronomer Core Docker Images☆106Updated 5 months ago
- Tool to automate data quality checks on data pipelines☆249Updated 2 years ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 3 years ago
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆465Updated this week
- Performant Redshift data source for Apache Spark☆136Updated 3 months ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- A continuous integration tool for Looker and LookML.☆217Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆584Updated 9 months ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.☆116Updated last year
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆405Updated last week
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆217Updated this week
- Python API for Deequ☆41Updated 4 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆205Updated 6 months ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- Spark package for checking data quality☆221Updated 4 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year