amundsen-io / amundsensearchlibraryLinks
Search service library for Amundsen
☆54Updated 2 weeks ago
Alternatives and similar repositories for amundsensearchlibrary
Users that are interested in amundsensearchlibrary are comparing it to the libraries listed below
Sorting:
- Metadata service library for Amundsen☆83Updated 2 weeks ago
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- Front-end service library for Amundsen☆280Updated 2 weeks ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated 2 years ago
- Snowflake Data Source for Apache Spark.☆226Updated last month
- Iceberg is a table format for large, slow-moving tabular data☆481Updated 2 years ago
- Performant Redshift data source for Apache Spark☆142Updated last month
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 3 months ago
- Airflow declarative DAGs via YAML☆133Updated last year
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆195Updated 2 weeks ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆151Updated last year
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆186Updated 2 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆222Updated 4 months ago
- Spark package for checking data quality☆221Updated 5 years ago
- Airflow configuration for Telemetry☆192Updated this week
- ☆200Updated last year
- Kinesis Connector for Structured Streaming☆136Updated last year
- Airflow Unit Tests and Integration Tests☆260Updated 2 years ago
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆479Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- ☆127Updated 5 years ago
- Airflow support for Marquez☆31Updated 4 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- ☆80Updated 3 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆228Updated 2 weeks ago
- A guide to running Airflow on Kubernetes☆173Updated 6 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 5 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Updated 4 years ago