A lightweight, declarative PySpark framework for data quality validation — check columns, rows, and entire datasets directly in your Spark pipelines
☆76Jun 4, 2026Updated this week
Alternatives and similar repositories for sparkdq
Users that are interested in sparkdq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An R package for easy cohort analysis with event data☆13Oct 29, 2023Updated 2 years ago
- R Interface for CrowdTangle Facebook API☆10Oct 27, 2021Updated 4 years ago
- ETL jobs for Firefox Telemetry☆29May 7, 2026Updated last month
- ☆15Aug 28, 2025Updated 9 months ago
- ☆13May 12, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Framework for Web API Packages☆16May 12, 2026Updated 3 weeks ago
- A from scratch Python implementation of Apache Kafka concepts including producers, brokers, topics, consumers, and offset management, bui…☆23Jul 29, 2025Updated 10 months ago
- Spark fires is a anti-pattern playground where we deliberately break Spark applications in various ways so you can observe what happens a…☆42Nov 18, 2024Updated last year
- Examples of building APIs in R using Plumber☆24Nov 12, 2021Updated 4 years ago
- A back-end agnostic spatial data frame inspired by rust trait implementations☆28Jul 10, 2023Updated 2 years ago
- Easy to use and open-source unknown stealer☆22Jul 24, 2023Updated 2 years ago
- ☆26Feb 14, 2025Updated last year
- ☆21Aug 8, 2024Updated last year
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Jul 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Extract Load Transform (ELT) framework is a metadata based batch orchestration framework for modern data platforms. Implemented using Azu…☆49Updated this week
- ☆20Updated this week
- Repo template para projeto de Engenharia de Dados☆31Jul 14, 2025Updated 10 months ago
- Repo for the open standards for data guidebook☆29Feb 3, 2026Updated 4 months ago
- Apache Arrow Flight example☆11Nov 9, 2020Updated 5 years ago
- Make working with pandas data and AWS DynamoDB easy☆21May 21, 2026Updated 2 weeks ago
- TachyonFX FTL is a browser-based editor and previewer for TachyonFX effects, powered by Ratzilla and Ace Editor☆27Feb 25, 2026Updated 3 months ago
- A collection of tools to support the creation and styling of content on {distill} websites☆28Nov 20, 2022Updated 3 years ago
- ☆11Nov 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a clone of the plugin of the same name present at: https://github.com/robbyrussell/oh-my-zsh☆25Aug 5, 2020Updated 5 years ago
- Power Query Custom Data Connector for Power BI REST APIs (Commercial)☆35Jun 2, 2026Updated last week
- This JavaScript CLI "undeletes' packages that have been removed from the NPM registry☆32Apr 29, 2026Updated last month
- Military-grade security for storing your files