MrPowers / pysparktestingexample
PySpark testing example project
☆13Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for pysparktestingexample
- pytest plugin to run the tests with support of pyspark☆85Updated 8 months ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated last year
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- Demo for GitHub Universe 2022☆12Updated last year
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆22Updated last year
- Dask integration for Snowflake☆30Updated last week
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- ☆20Updated 3 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- dbt adapter for Athena☆39Updated 5 months ago
- ☆53Updated last year
- ☆42Updated 3 weeks ago
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- ☆18Updated last year
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆29Updated 2 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Black for Databricks notebooks☆44Updated 3 months ago
- ☆24Updated 4 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- Read Delta tables without any Spark☆47Updated 8 months ago
- Spark runtime on AWS Lambda☆97Updated 2 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆25Updated last year
- Pandas helper functions☆29Updated last year