ktrueda / parquet-tools
easy install parquet-tools
☆176Updated 9 months ago
Alternatives and similar repositories for parquet-tools:
Users that are interested in parquet-tools are comparing it to the libraries listed below
- Command line (CLI) tool to inspect Apache Parquet files on the go☆189Updated last year
- ☆254Updated this week
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆252Updated last year
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆127Updated 2 weeks ago
- A library that provides useful extensions to Apache Spark and PySpark.☆223Updated last month
- Write your dbt models using Ibis☆64Updated last month
- ☆278Updated 2 weeks ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Distributed SQL Engine in Python using Dask☆401Updated 7 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆232Updated 3 weeks ago
- ☆68Updated 3 months ago
- Great Expectations Airflow operator☆162Updated last week
- dbt-prql allows writing PRQL in dbt models☆104Updated 3 weeks ago
- ☆32Updated last year
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆185Updated this week
- Making DAG construction easier☆261Updated last month
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆118Updated 2 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆197Updated last year
- The athena adapter plugin for dbt (https://getdbt.com)☆140Updated 2 years ago
- Command line tool for inspecting Parquet files☆325Updated 8 months ago
- CLI for DuckDB☆46Updated 2 years ago
- An integration for dbt and fzf that allows interactive selection and search of dbt models.☆71Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆77Updated last week
- DuckDB extension for Delta Lake☆176Updated 2 weeks ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆99Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆432Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆343Updated 2 weeks ago
- A Postgres Proxy Server in Python☆275Updated 4 months ago
- A provider package for DuckDB☆16Updated last year