A declarative PySpark framework for row- and aggregate-level data quality validation.
☆73Jan 1, 2026Updated 4 months ago
Alternatives and similar repositories for sparkdq
Users that are interested in sparkdq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Samples for fabric user data functions☆27Updated this week
- R Interface for CrowdTangle Facebook API☆10Oct 27, 2021Updated 4 years ago
- ETL jobs for Firefox Telemetry☆29May 7, 2026Updated last week
- Run SQL queries on Snowflake from R☆11Oct 20, 2025Updated 7 months ago
- ☆16Nov 27, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Framework for Web API Packages☆16May 12, 2026Updated last week
- textwrap.dedent with t-string support☆23Dec 15, 2025Updated 5 months ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 9 months ago
- A simple plugin to insert the correct shebang of the file.☆11Apr 22, 2017Updated 9 years ago
- Source code for the "Scala For Beginners" book. https://leanpub.com/scalaforbeginners/☆14Oct 14, 2019Updated 6 years ago
- Use Text to SQL to analyze US Government contract data☆23Mar 29, 2025Updated last year
- R package: make pixel art interactively in a plot window, get a matrix, make a gif☆24Jun 4, 2024Updated last year
- Spark fires is a anti-pattern playground where we deliberately break Spark applications in various ways so you can observe what happens a…☆42Nov 18, 2024Updated last year
- Simulate slow, resource-constrained machines to reproduce CI failures and hunt flaky tests☆25Dec 6, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Datagenerator for Data Services☆16Sep 29, 2025Updated 7 months ago
- Easy to use and open-source unknown stealer☆22Jul 24, 2023Updated 2 years ago
- protobuf pyspark conversion☆23Jun 7, 2023Updated 2 years ago
- ☆25Feb 14, 2025Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Course materials for Stat 154, spring 2018, at UC Berkeley☆27Nov 15, 2018Updated 7 years ago
- ☆29Updated this week
- ☆20May 13, 2026Updated last week
- Repo template para projeto de Engenharia de Dados☆31Jul 14, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code to index Hive tables to Solr and Solr indexes to Hive☆46May 16, 2019Updated 7 years ago
- DE Bench: Can Agents Solve Real-World Data Engineering Problems? Built to test Ardent's AI Data Engineer☆35Dec 7, 2025Updated 5 months ago
- A collection of tools to support the creation and styling of content on {distill} websites☆28Nov 20, 2022Updated 3 years ago
- ☆11Nov 26, 2024Updated last year
- Python package containing root-finding methods written in Cython☆22Oct 13, 2023Updated 2 years ago
- This is a clone of the plugin of the same name present at: https://github.com/robbyrussell/oh-my-zsh☆25Aug 5, 2020Updated 5 years ago
- Python API for Deequ☆820May 9, 2026Updated last week
- Black for Databricks notebooks☆48Jun 10, 2025Updated 11 months ago
- a lightweight, comprehensive solution for managing delta tables built on polars and deltalake☆121Jan 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pager for tabular data and SQL output☆12Mar 29, 2023Updated 3 years ago
- Associated blog post - https://tristanrhodes.com/blog/Adventures-in-Algorithmic-Trading-on-the-Runescape-Grand-Exchange☆10Oct 14, 2024Updated last year
- A declarative, 🐻❄️-native data frame validation library.☆587May 12, 2026Updated last week
- ☆11Mar 7, 2025Updated last year
- Examples for Apache Oozie book☆18May 30, 2016Updated 9 years ago
- Go wrapper around SSH that speaks AWS API☆16Aug 15, 2023Updated 2 years ago
- Tracks your most used directories, based on 'frecency'.☆25Mar 18, 2025Updated last year