pytest plugin to run the tests with support of pyspark
☆88May 21, 2025Updated last year
Alternatives and similar repositories for pytest-spark
Users that are interested in pytest-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Mar 23, 2026Updated 2 months ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆36Jul 9, 2024Updated last year
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Mar 22, 2016Updated 10 years ago
- DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters☆22Jan 6, 2022Updated 4 years ago
- PySpark test helper methods with beautiful error messages☆769May 20, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Udacity Data Pipeline Exercises☆15Jun 6, 2020Updated 6 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆456Apr 2, 2026Updated 2 months ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆687Mar 6, 2025Updated last year
- ✨ A Pydantic to PySpark schema library☆126May 24, 2026Updated 2 weeks ago
- A lightweight data processing framework for Apache Spark☆16Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A content-filtering bypass system developed specifically to allow access to trans-related resources on public networks (libraries, school…☆27Nov 15, 2014Updated 11 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Apr 9, 2020Updated 6 years ago
- ☆13Jul 23, 2025Updated 10 months ago
- ☆21Aug 26, 2025Updated 9 months ago
- CLI tool to convert a python project's %-formatted strings to f-strings.☆17Oct 18, 2019Updated 6 years ago
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆16Sep 10, 2024Updated last year
- Building a Q&A app (powered by a LLM model) using AWS Bedrock, AWS Kendra, AWS S3 and Streamlit in just a couple of hours☆17Dec 7, 2023Updated 2 years ago
- rb_status_plugin : Data confidence tool for Airflow☆12Jan 7, 2023Updated 3 years ago
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆448Jul 16, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆45Dec 9, 2025Updated 5 months ago
- A three-hour tutorial on property-based testing with https://hypothesis.works☆59Mar 3, 2024Updated 2 years ago
- ☆13Feb 19, 2025Updated last year
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 4 months ago
- The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy☆12Mar 30, 2023Updated 3 years ago
- Basic machine learning algorithm implementation☆18Mar 7, 2024Updated 2 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- Building a GPT-4 Q&A app using Azure OpenAI, Pinecone and Streamlit in just a couple of hours☆23Jul 6, 2023Updated 2 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 8 years ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Sample code for Gen1 to Gen2 migration patterns.☆11May 26, 2021Updated 5 years ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆647Updated this week
- gzip middleware for ASGI applications, extracted from Starlette☆12Apr 9, 2026Updated 2 months ago
- ☆26Mar 4, 2024Updated 2 years ago
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,184May 20, 2026Updated 2 weeks ago