treeverse / lakeFS-hooks
a simple lakeFS webhook for pre-commit and pre-merge validation of data objects
☆12Updated last year
Alternatives and similar repositories for lakeFS-hooks:
Users that are interested in lakeFS-hooks are comparing it to the libraries listed below
- lakeFS airflow operator☆26Updated last year
- Boto S3 Router provides a Boto3-like client that routes requests between S3 clients according to the bucket and the key in the request.☆18Updated 3 years ago
- lakeview is a visibility tool for S3 based data lakes☆29Updated last year
- Trino connectors for accessing APIs with an OpenAPI spec☆37Updated this week
- An open specification for data products in Data Mesh☆56Updated 5 months ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 3 years ago
- lakefs-samples repository☆79Updated 3 weeks ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Apache Hive Metastore in Standalone Mode With Docker☆12Updated 9 months ago
- GetInData Helm Charts repository☆12Updated 2 years ago
- Snowflake connector repository for the Apache Flink project☆37Updated 2 weeks ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated last week
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services☆25Updated last month
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆28Updated 3 months ago
- ☆16Updated 2 years ago
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆34Updated last year
- Unity Catalog UI☆40Updated 7 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆96Updated this week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆178Updated this week
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- CLI for running Airbyte sources & destinations locally without Airbyte server☆32Updated this week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- Helm charts☆19Updated last week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆52Updated 2 months ago
- dbt-core-interface is an MIT licensed high level wrapper for dbt-core that can be used to drive third party integrations such as servers,…☆32Updated last year
- Data Catalog for Databases and Data Warehouses☆34Updated last year
- Performance Observability for Apache Spark☆248Updated 2 weeks ago