ktrueda / parquet-toolsLinks
easy install parquet-tools
☆183Updated last year
Alternatives and similar repositories for parquet-tools
Users that are interested in parquet-tools are comparing it to the libraries listed below
Sorting:
- Command line (CLI) tool to inspect Apache Parquet files on the go☆198Updated 2 years ago
- Write your dbt models using Ibis☆74Updated 10 months ago
- Pylint plugin for static code analysis on Airflow code☆97Updated 5 years ago
- Distributed SQL Engine in Python using Dask☆409Updated last year
- ☆372Updated last week
- dbc is a command-line tool for installing and managing ADBC drivers☆79Updated this week
- Turning PySpark Into a Universal DataFrame API☆477Updated last week
- ☆34Updated 2 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆108Updated this week
- ☆64Updated 8 months ago
- ☆70Updated last year
- ☆58Updated 3 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆32Updated 2 years ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆126Updated 11 months ago
- Run, mock and test fake Snowflake databases locally.☆164Updated 3 weeks ago
- A provider package for DuckDB☆17Updated 2 years ago
- dbt-prql allows writing PRQL in dbt models☆107Updated last week
- 🏃♀️ Minimalist SQL orchestrator☆302Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆91Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.☆232Updated last week
- Integrates DuckDB with Google BigQuery, allowing direct querying and management of BigQuery datasets☆150Updated 2 weeks ago
- Python bindings for sqlparser-rs☆200Updated 8 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆96Updated this week
- ☆92Updated last year
- Work with your web service, database, and streaming schemas in a single format.☆350Updated last month
- Command line tool for inspecting Parquet files☆393Updated last year
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆107Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago