ktrueda / parquet-toolsLinks
easy install parquet-tools
☆182Updated last year
Alternatives and similar repositories for parquet-tools
Users that are interested in parquet-tools are comparing it to the libraries listed below
Sorting:
- Command line (CLI) tool to inspect Apache Parquet files on the go☆195Updated last year
- Write your dbt models using Ibis☆70Updated 5 months ago
- Distributed SQL Engine in Python using Dask☆407Updated last year
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- ☆70Updated 8 months ago
- ☆309Updated this week
- Python package for executing Malloy☆31Updated 7 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆258Updated last year
- ☆53Updated this week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- ☆34Updated 2 years ago
- Run, mock and test fake Snowflake databases locally.☆149Updated last week
- Turning PySpark Into a Universal DataFrame API☆426Updated this week
- Python bindings for sqlparser-rs☆193Updated 3 months ago
- ☆22Updated last month
- ☆59Updated 4 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated 3 weeks ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆83Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆218Updated last week
- Dask integration for Snowflake☆30Updated last month
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 2 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆229Updated last month
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Enforce Best Practices for all your Airflow DAGs. ⭐☆104Updated last week
- dbt adapter for Athena☆38Updated last year
- ☆155Updated 3 months ago
- Making DAG construction easier☆272Updated last week
- CLI for DuckDB☆45Updated 3 years ago
- Apache Avro <-> pandas DataFrame☆138Updated 2 weeks ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆87Updated 6 months ago