bikash / DataQuality
Tutorial and examples of Data Quality in Big Data System
☆12Updated 7 years ago
Alternatives and similar repositories for DataQuality:
Users that are interested in DataQuality are comparing it to the libraries listed below
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆34Updated last week
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆125Updated last week
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆27Updated 2 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆75Updated 9 months ago
- Open-source metadata collector based on ODD Specification☆43Updated last year
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- Javascript library to talk to multiple OLAP backends from multiple frontends☆18Updated 11 years ago
- ☆10Updated last year
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆31Updated last year
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆42Updated 2 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆103Updated 2 years ago
- A proof of concept using Divolte, Kafka, Druid and Superset☆61Updated 4 years ago
- The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …☆45Updated last week
- Tool to automate data quality checks on data pipelines☆253Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆20Updated 5 years ago
- A library for data warehouse and data integration pattern and architecture documentation.☆49Updated last year
- Data Lineage Tracing Library☆22Updated 3 years ago
- Generic interface exchange format for Data Warehouse Automation and ETL generation.☆38Updated 6 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆25Updated 2 weeks ago
- Drools processor for Apache NiFi☆38Updated 5 years ago
- Source code for 'Pro Spatial with SQL Server 2012' by Alastair Aitchison☆15Updated 7 years ago
- Generate and Visualize Data Lineage from query history☆316Updated last year
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆24Updated 3 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆72Updated last year
- Data Tools Subjective List☆82Updated last year