pytest plugin to run the tests with support of pyspark
☆88May 21, 2025Updated 10 months ago
Alternatives and similar repositories for pytest-spark
Users that are interested in pytest-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated this week
- Geohex v3.2 python implementation.☆16Apr 29, 2021Updated 4 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Mar 22, 2016Updated 10 years ago
- DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters☆22Jan 6, 2022Updated 4 years ago
- PySpark test helper methods with beautiful error messages☆756Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- control spark-shell from vim☆11Oct 27, 2016Updated 9 years ago
- Udacity Data Pipeline Exercises☆15Jun 6, 2020Updated 5 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- A curated list of awesome tools for Amazon EKS 🌊☆14May 30, 2020Updated 5 years ago
- Point Cloud Registration from Multiple Intel Realsense Frames☆13Dec 19, 2019Updated 6 years ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- A pyspark lib to validate data quality☆18Nov 11, 2022Updated 3 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Feb 8, 2026Updated last month
- ✨ A Pydantic to PySpark schema library☆121Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Easily make interactive plots of player-tracking data☆11Sep 20, 2021Updated 4 years ago
- Example repositories for my blog posts☆12Jul 22, 2023Updated 2 years ago
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆16Sep 10, 2024Updated last year
- rb_status_plugin : Data confidence tool for Airflow☆12Jan 7, 2023Updated 3 years ago
- Building a Q&A app (powered by a LLM model) using AWS Bedrock, AWS Kendra, AWS S3 and Streamlit in just a couple of hours☆17Dec 7, 2023Updated 2 years ago
- IPython magics to work with DBT☆15Jul 22, 2022Updated 3 years ago
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆444Jul 16, 2025Updated 8 months ago
- Base classes to use when writing tests with Spark☆1,549Mar 23, 2026Updated last week
- ☆14Jun 13, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Feb 19, 2025Updated last year
- Code for H. Narasimhan, "Learning with Complex Loss Functions and Constraints", AISTATS 2018☆11Mar 21, 2018Updated 8 years ago
- The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy☆12Mar 30, 2023Updated 3 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- ☆21Nov 11, 2023Updated 2 years ago
- Basic machine learning algorithm implementation☆18Mar 7, 2024Updated 2 years ago
- This application "listens" for a ticket creation event from Zendesk, analyses the ticket for negative sentiment, tags the ticket accordin…☆14Mar 10, 2025Updated last year
- ☆14Jul 5, 2022Updated 3 years ago
- #DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Grap…☆13Jun 27, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Helm Chart for Apache Airflow☆14Oct 17, 2018Updated 7 years ago
- auto set cookie domain like google analytics☆11Sep 9, 2023Updated 2 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆638Mar 13, 2026Updated 2 weeks ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- Automated testing and deployment of a simple Flask-based (RESTful) micro-service to a production-like environment on AWS, using Docker co…☆43Feb 2, 2023Updated 3 years ago
- Data Analysis of Bicycle Manufacturing Company Using Python, SQL and Power BI☆13Apr 14, 2023Updated 2 years ago