IntegriChain1 / s3parqLinks
Parquet file management in S3 for Athena / Spectrum / Presto partitioning
β22Updated 4 months ago
Alternatives and similar repositories for s3parq
Users that are interested in s3parq are comparing it to the libraries listed below
Sorting:
- A template for an AWS Lambda function that triggers Prefect Flow Runsβ20Updated 3 years ago
- An experimental Athena extension for DuckDB π€β54Updated 5 months ago
- a pytest plugin for dbt adapter test suitesβ19Updated last year
- The sane way of building a data layer in Airflowβ24Updated 5 years ago
- DataHub on AWS demonstration resourcesβ10Updated 2 years ago
- A serverless duckDB deployment at GCPβ39Updated 2 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshiftβ21Updated 7 months ago
- A collection of python utility functionsβ11Updated 11 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and repβ¦β20Updated 5 years ago
- An infrastructure as code approach to deploying Snowflake using Terraformβ25Updated 2 years ago
- Dask integration for Snowflakeβ30Updated 6 months ago
- β οΈ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.β41Updated 5 months ago
- Faker for Snowflake!β33Updated 2 years ago
- β52Updated this week
- The open source version of the Amazon Redshift Cluster Management Guide.β48Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Coreβ33Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- π Run, schedule, and manage your dbt jobs using Kubernetes.β24Updated 6 years ago
- Example Set up For DBT Cloud using Github Integrationsβ11Updated 5 years ago
- CLI for data platformβ19Updated last year
- A conda-smithy repository for python-duckdb.β13Updated last week
- Using the Parquet file format with Pythonβ15Updated last year
- β17Updated last month
- Styles for dbt on the netβ10Updated 6 months ago
- β36Updated 4 months ago
- Amundsen Gremlinβ21Updated 2 years ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDBβ33Updated 2 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed dailyβ97Updated this week
- π Docker image for AWS Glue Spark/Pythonβ23Updated last year
- BigQuery Schema Conversion Toolβ23Updated 4 years ago