IntegriChain1 / s3parqLinks
Parquet file management in S3 for Athena / Spectrum / Presto partitioning
☆22Updated 6 months ago
Alternatives and similar repositories for s3parq
Users that are interested in s3parq are comparing it to the libraries listed below
Sorting:
- A collection of python utility functions☆11Updated last year
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- ☆53Updated last week
- Continuously synchronize directories from remote object store to local filesystem☆106Updated 5 months ago
- Amundsen Gremlin☆21Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated 2 months ago
- Dask integration for Snowflake☆30Updated last week
- Convert JSON files to Parquet using PyArrow☆96Updated last year
- ☆15Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last week
- ☆12Updated last year
- The elegance of Airflow + the power of AWS☆50Updated last year
- CLI for data platform☆19Updated last year
- ☆73Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- a pytest plugin for dbt adapter test suites☆19Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- ☆19Updated 5 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated last month
- The best Python package for comparing two dataframes☆11Updated 3 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- ☆11Updated 8 months ago
- Faker for Snowflake!☆33Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago