mrpowers-io / quinnLinks
pyspark methods to enhance developer productivity π£ π― π
β672Updated 2 months ago
Alternatives and similar repositories for quinn
Users that are interested in quinn are comparing it to the libraries listed below
Sorting:
- PySpark test helper methods with beautiful error messagesβ696Updated last month
- Delta Lake helper methods in PySparkβ326Updated 8 months ago
- Spark style guideβ259Updated 8 months ago
- Python API for Deequβ771Updated 2 months ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricksβ431Updated 3 months ago
- Apache Airflow integration for dbtβ404Updated last year
- A Python Library to support running data quality rules while the spark job is runningβ‘β188Updated 3 weeks ago
- Essential Spark extensions and helper methods β¨π²β760Updated 7 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaβ¦β758Updated 3 weeks ago
- A simplified, lightweight ETL Framework based on Apache Sparkβ585Updated last year
- β199Updated last year
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)β447Updated last week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflowβ216Updated 3 weeks ago
- Delta Lake examplesβ224Updated 7 months ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β369Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.β224Updated 2 months ago
- Great Expectations Airflow operatorβ164Updated this week
- Airflow Unit Tests and Integration Testsβ259Updated 2 years ago
- Snowflake Data Source for Apache Spark.β226Updated last month
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β168Updated last year
- Performant Redshift data source for Apache Sparkβ140Updated last month
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesβ63Updated 2 years ago
- This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurringβ¦β1,144Updated 8 months ago
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.β199Updated 3 weeks ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomerβ148Updated this week
- Testing framework for Databricks notebooksβ300Updated last year
- Guides and docs to help you get up and running with Apache Airflow.β806Updated 2 years ago
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.β251Updated 3 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for severβ¦β249Updated 3 months ago
- Create HTML profiling reports from Apache Spark DataFramesβ196Updated 5 years ago