ktrueda / parquet-toolsLinks
easy install parquet-tools
☆183Updated last year
Alternatives and similar repositories for parquet-tools
Users that are interested in parquet-tools are comparing it to the libraries listed below
Sorting:
- Command line (CLI) tool to inspect Apache Parquet files on the go☆198Updated 2 years ago
- Write your dbt models using Ibis☆74Updated 9 months ago
- Distributed SQL Engine in Python using Dask☆409Updated last year
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- ☆357Updated last week
- ☆70Updated last year
- ☆34Updated 2 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆260Updated 2 years ago
- ✨ A Pydantic to PySpark schema library☆116Updated this week
- Enforce Best Practices for all your Airflow DAGs. ⭐☆107Updated last week
- Turning PySpark Into a Universal DataFrame API☆470Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆233Updated 2 months ago
- Making DAG construction easier☆283Updated 3 weeks ago
- A library that provides useful extensions to Apache Spark and PySpark.☆232Updated 3 weeks ago
- ☆58Updated this week
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Database connectivity API standard and libraries for Apache Arrow☆526Updated this week
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 weeks ago
- A provider package for DuckDB☆17Updated 2 years ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆90Updated last month
- Great Expectations Airflow operator☆169Updated last month
- 🏃♀️ Minimalist SQL orchestrator☆299Updated 2 weeks ago
- Command line tool for inspecting Parquet files☆387Updated last year
- Apache Avro <-> pandas DataFrame☆138Updated 4 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆95Updated 3 weeks ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 2 years ago
- ☆330Updated last month
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆149Updated 2 weeks ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆145Updated 4 months ago
- ☆63Updated 8 months ago