ubisoft / mobydqLinks
Tool to automate data quality checks on data pipelines
☆255Updated 2 years ago
Alternatives and similar repositories for mobydq
Users that are interested in mobydq are comparing it to the libraries listed below
Sorting:
- Generate and Visualize Data Lineage from query history☆326Updated last year
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- The metrics layer for your data. Join us at https://metriql.com/slack☆307Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- ☆199Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Astronomer Core Docker Images☆107Updated last year
- Apache Airflow integration for dbt☆404Updated last year
- Great Expectations Airflow operator☆164Updated last week
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated last month
- Schema modelling framework for decentralised domain-driven ownership of data.☆254Updated last year
- Writes the Singer format from Python☆562Updated 2 months ago
- Data Lineage Tracking And Visualization Solution☆626Updated this week
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated 2 weeks ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆148Updated this week
- DataQuality for BigData☆144Updated last year
- pyspark methods to enhance developer productivity 📣 👯 🎉☆672Updated 2 months ago
- Python API for Deequ☆771Updated 2 months ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆188Updated this week
- Pylint plugin for static code analysis on Airflow code☆94Updated 4 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated last year
- Making DAG construction easier☆265Updated 2 weeks ago