bikash / DataQualityLinks
Tutorial and examples of Data Quality in Big Data System
☆12Updated 8 years ago
Alternatives and similar repositories for DataQuality
Users that are interested in DataQuality are comparing it to the libraries listed below
Sorting:
- The premier open source Data Quality solution☆644Updated 2 months ago
- Tool to automate data quality checks on data pipelines☆254Updated 3 years ago
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆36Updated 7 months ago
- DataQuality for BigData☆144Updated last year
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated last week
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆166Updated 3 weeks ago
- Superset Quick Start Guide, published by Packt☆56Updated last year
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆104Updated 4 years ago
- a collection of resources and blogs about Apache Superset☆86Updated 3 years ago
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆28Updated 3 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆115Updated last year
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆66Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated 2 weeks ago
- OlaPy, an experimental OLAP engine based on Pandas☆109Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- A visual ETL development and debugging tool for big data☆154Updated 2 years ago
- Generate and Visualize Data Lineage from query history☆327Updated 2 years ago
- Apache NiFi example flows☆207Updated 5 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- A visual data pipeline builder with various backends☆104Updated this week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated last week
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆32Updated 2 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Updated 6 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆106Updated 2 years ago
- Airflow ETL MS SQL Sample Project☆25Updated 7 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- ☆23Updated 4 years ago
- Knowage is the professional open source suite for modern business analytics over traditional sources and big data systems.☆428Updated this week
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated last year