bikash / DataQualityLinks
Tutorial and examples of Data Quality in Big Data System
☆12Updated 8 years ago
Alternatives and similar repositories for DataQuality
Users that are interested in DataQuality are comparing it to the libraries listed below
Sorting:
- The premier open source Data Quality solution☆646Updated last month
- XML/A engine for real-time OLAP analytics☆48Updated 8 years ago
- Tool to automate data quality checks on data pipelines☆257Updated 3 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆68Updated 2 years ago
- Superset Quick Start Guide, published by Packt☆56Updated last year
- A visual ETL development and debugging tool for big data☆156Updated 3 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 3 years ago
- Apache NiFi example flows☆210Updated 6 years ago
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆37Updated last year
- DataQuality for BigData☆147Updated 2 years ago
- This is a GitHub for all of my NiFi Templates☆47Updated 5 years ago
- Generate and Visualize Data Lineage from query history☆327Updated 2 years ago
- OlaPy, an experimental OLAP engine based on Pandas☆109Updated 2 years ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆51Updated 3 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆107Updated 3 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆104Updated 2 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 4 months ago
- The nifi of localized support include chinese and japanese .☆31Updated 7 years ago
- Open-source metadata collector based on ODD Specification☆44Updated 2 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆119Updated 2 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 5 years ago
- Pulsar Data Visualization, gets the data from Pulsar Reporting API, builds different charts and displays them in the browser.☆53Updated 10 years ago
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆105Updated 4 years ago
- Data lineage tools in python☆47Updated last year
- A visual data pipeline builder with various backends☆107Updated this week
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆28Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆181Updated last month
- Data Lineage Tracing Library☆23Updated 4 years ago
- Example of running MDX on Druid via Mondrian and Calcite☆26Updated 9 years ago