IntegriChain1 / s3parq
Parquet file management in S3 for Athena / Spectrum / Presto partitioning
☆22Updated 2 months ago
Alternatives and similar repositories for s3parq:
Users that are interested in s3parq are comparing it to the libraries listed below
- Data Catalog for Databases and Data Warehouses☆31Updated last year
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- Amundsen Gremlin☆20Updated 2 years ago
- DataHub on AWS demonstration resources☆10Updated last year
- Using the Parquet file format with Python☆15Updated last year
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆19Updated 3 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 10 months ago
- Convert JSON files to Parquet using PyArrow☆95Updated last year
- CLI for data platform☆19Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- An experimental Athena extension for DuckDB 🐤☆51Updated 2 weeks ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- A collection of python utility functions☆12Updated 6 months ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated last year
- A conda-smithy repository for python-duckdb.☆13Updated 2 months ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 4 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- a pytest plugin for dbt adapter test suites☆19Updated last year
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- lakeview is a visibility tool for S3 based data lakes☆30Updated last year
- Postgres utility package for dbt (getdbt.com)☆18Updated 3 years ago
- ☆21Updated 4 months ago
- An infrastructure as code approach to deploying Snowflake using Terraform☆24Updated last year
- ☆14Updated 3 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Singer.io transformation component between Taps and Targets - PipelineWise compatible☆19Updated 4 months ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago