chhantyal / parquet-cli
Command line (CLI) tool to inspect Apache Parquet files on the go
☆173Updated last year
Related projects ⓘ
Alternatives and complementary repositories for parquet-cli
- easy install parquet-tools☆162Updated 4 months ago
- Benchmark data warehouses under Fivetran-like conditions☆164Updated last year
- A library that provides useful extensions to Apache Spark and PySpark.☆195Updated this week
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Apache Avro <-> pandas DataFrame☆134Updated 3 months ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 3 years ago
- ☆67Updated this week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 10 months ago
- Astronomer Core Docker Images☆106Updated 5 months ago
- pytest plugin to run the tests with support of pyspark☆85Updated 8 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated 4 months ago
- Performant Redshift data source for Apache Spark☆136Updated 3 months ago
- ☆150Updated 3 weeks ago
- ☆196Updated last year
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 2 years ago
- dbt-redshift contains all of the code enabling dbt to work with Amazon Redshift☆98Updated this week
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆108Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆162Updated this week
- Flowchart for debugging Spark applications☆101Updated last month
- The Internals of Delta Lake☆182Updated last month
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated last year
- Snowflake Data Source for Apache Spark.☆217Updated this week
- Great Expectations Airflow operator☆159Updated last week
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆262Updated last year