ktrueda / parquet-tools
easy install parquet-tools
☆163Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for parquet-tools
- Command line (CLI) tool to inspect Apache Parquet files on the go☆175Updated last year
- ☆160Updated last month
- A library that provides useful extensions to Apache Spark and PySpark.☆196Updated 2 weeks ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆174Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆248Updated 11 months ago
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆109Updated last week
- ☆242Updated 2 months ago
- Pythonic Iceberg REST Catalog☆67Updated 2 months ago
- Great Expectations Airflow operator☆159Updated 3 weeks ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- ☆67Updated 2 weeks ago
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- Write your dbt models using Ibis☆53Updated last month
- A tool that makes it easy to run modular Trino environments locally.☆33Updated this week
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated 5 months ago
- dbt-redshift contains all of the code enabling dbt to work with Amazon Redshift☆101Updated this week
- ✨ A Pydantic to PySpark schema library☆57Updated this week
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 3 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25Updated 2 years ago
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆166Updated 2 weeks ago
- DuckDB extension for Delta Lake☆139Updated this week
- A purely experimental DuckDB Deltalake extension☆94Updated 2 weeks ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆37Updated this week
- Read Delta tables without any Spark☆47Updated 8 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆217Updated this week
- Distributed SQL Engine in Python using Dask☆397Updated 2 months ago