treeverse / lakeview
lakeview is a visibility tool for S3 based data lakes
☆30Updated last year
Related projects ⓘ
Alternatives and complementary repositories for lakeview
- Boto S3 Router provides a Boto3-like client that routes requests between S3 clients according to the bucket and the key in the request.☆18Updated 2 years ago
- Helm charts☆18Updated last week
- lakeFS airflow operator☆26Updated last year
- Data Catalog for Databases and Data Warehouses☆31Updated 9 months ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆42Updated 9 months ago
- ☆42Updated last week
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆19Updated 2 months ago
- a simple lakeFS webhook for pre-commit and pre-merge validation of data objects☆12Updated last year
- Unity Catalog UI☆39Updated 2 months ago
- DuckDB Docker image☆24Updated last month
- Pythonic Iceberg REST Catalog☆66Updated 2 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Packaging DuckDB for Node.js Lambda functions. Example application: https://github.com/tobilg/serverless-duckdb☆92Updated last month
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated 2 weeks ago
- a pytest plugin for dbt adapter test suites☆19Updated last year
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆55Updated last year
- Dashboard for operating Flink jobs and deployments.☆25Updated 3 weeks ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆38Updated 6 months ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago
- ☆26Updated last year
- DuckDB for streaming data☆68Updated 7 months ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- A table-type dbt materialization for Snowflake to enable Time Travel☆20Updated 7 months ago
- Continuously synchronize directories from remote object store to local filesystem☆97Updated this week
- Query Snowflake tables locally with DuckDB, without any need for a running warehouse☆95Updated 2 weeks ago
- ☆26Updated 2 months ago
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifi☆27Updated 2 years ago