A declarative PySpark framework for row- and aggregate-level data quality validation.
☆72Jan 1, 2026Updated 3 months ago
Alternatives and similar repositories for sparkdq
Users that are interested in sparkdq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- R Interface for CrowdTangle Facebook API☆10Oct 27, 2021Updated 4 years ago
- ETL jobs for Firefox Telemetry☆29Apr 22, 2026Updated last week
- ☆15Aug 28, 2025Updated 8 months ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 8 months ago
- ☆15Aug 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Use Text to SQL to analyze US Government contract data☆23Mar 29, 2025Updated last year
- A from scratch Python implementation of Apache Kafka concepts including producers, brokers, topics, consumers, and offset management, bui…☆23Jul 29, 2025Updated 9 months ago
- A rust implementation of Andrej Karpathy's Micrograd☆15Apr 28, 2025Updated last year
- Spark fires is a anti-pattern playground where we deliberately break Spark applications in various ways so you can observe what happens a…☆42Nov 18, 2024Updated last year
- Python Data Audit☆12Jul 24, 2020Updated 5 years ago
- Automatically convert functions to schemas for LLM function calling.☆21Sep 29, 2024Updated last year
- R package to wrap the Deutsche Bahn Fahrplan API