treeverse / lakeviewLinks
lakeview is a visibility tool for S3 based data lakes
☆29Updated last week
Alternatives and similar repositories for lakeview
Users that are interested in lakeview are comparing it to the libraries listed below
Sorting:
- ☆52Updated 2 weeks ago
- IceRunner is an Apache Arrow Flight Server Implementation for Apache Iceberg Tables☆9Updated 3 months ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆39Updated last month
- Python package for querying iceberg data through duckdb.☆70Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated 5 months ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Packaging DuckDB for Node.js Lambda functions. Example application: https://github.com/tobilg/serverless-duckdb☆122Updated last week
- An experimental Athena extension for DuckDB 🐤☆54Updated 6 months ago
- Lambda function to serverlessly repartition parquet files in S3☆36Updated 3 months ago
- Python stream processing for analytics☆40Updated 3 weeks ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆100Updated this week
- A serverless duckDB deployment at GCP☆40Updated 2 years ago
- Data pipelines from re-usable components☆108Updated 2 years ago
- An example of how to run DuckDB on AWS Lambda & API Gateway.☆158Updated last month
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Unity Catalog UI☆41Updated 10 months ago
- A curated list to help you manage temporal data across many modalities 🚀.☆115Updated 2 years ago
- 🚀 GizmoSQL — High-Performance SQL Server for the Cloud☆124Updated this week
- Multi-hop declarative data pipelines☆117Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆86Updated 5 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- ☆28Updated 10 months ago
- CLI for running Airbyte sources & destinations locally without Airbyte server☆32Updated 3 weeks ago
- Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.☆84Updated 8 months ago
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- Serverless multi-protocol + multi-destination event collection system.☆207Updated 8 months ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆56Updated this week