mrpowers-io / quinn
pyspark methods to enhance developer productivity π£ π― π
β666Updated 2 weeks ago
Alternatives and similar repositories for quinn:
Users that are interested in quinn are comparing it to the libraries listed below
- PySpark test helper methods with beautiful error messagesβ674Updated 2 weeks ago
- Spark style guideβ258Updated 5 months ago
- Delta Lake helper methods in PySparkβ322Updated 6 months ago
- Python API for Deequβ754Updated 5 months ago
- Essential Spark extensions and helper methods β¨π²β758Updated 5 months ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricksβ422Updated last month
- A Python Library to support running data quality rules while the spark job is runningβ‘β180Updated this week
- Apache Airflow integration for dbtβ400Updated 10 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spaβ¦β736Updated last month
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflowβ212Updated 3 weeks ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.β390Updated this week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)β441Updated last week
- β198Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for severβ¦β238Updated last month
- A simplified, lightweight ETL Framework based on Apache Sparkβ584Updated last year
- A boilerplate for writing PySpark Jobsβ394Updated last year
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β367Updated last week
- Dynamically generate Apache Airflow DAGs from YAML configuration filesβ1,269Updated last week
- β43Updated 3 years ago
- Airflow Unit Tests and Integration Testsβ256Updated 2 years ago
- Testing framework for Databricks notebooksβ296Updated 11 months ago
- A library that provides useful extensions to Apache Spark and PySpark.β220Updated this week
- Great Expectations Airflow operatorβ161Updated this week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β167Updated last year
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.β247Updated 3 years ago
- Snowflake Data Source for Apache Spark.β222Updated 3 months ago
- Create HTML profiling reports from Apache Spark DataFramesβ195Updated 5 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.β345Updated 9 months ago
- Data ingestion library for Amundsen to build graph and search indexβ205Updated last year
- This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurringβ¦β1,112Updated 6 months ago