bikash / DataQualityLinks
Tutorial and examples of Data Quality in Big Data System
☆12Updated 8 years ago
Alternatives and similar repositories for DataQuality
Users that are interested in DataQuality are comparing it to the libraries listed below
Sorting:
- The premier open source Data Quality solution☆642Updated last week
- A visual ETL development and debugging tool for big data☆154Updated 3 years ago
- a collection of resources and blogs about Apache Superset☆89Updated 4 years ago
- Tool to automate data quality checks on data pipelines☆256Updated 3 years ago
- DataQuality for BigData☆145Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 4 years ago
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆105Updated 4 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 2 months ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 4 years ago
- XML/A engine for real-time OLAP analytics☆48Updated 8 years ago
- OlaPy, an experimental OLAP engine based on Pandas☆108Updated 2 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆68Updated 2 years ago
- Superset Quick Start Guide, published by Packt☆56Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆176Updated 3 weeks ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Updated 12 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆118Updated last year
- Generate SQL from Graphic Walker visualization DSL☆13Updated last year
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆51Updated 3 years ago
- Apache NiFi example flows☆210Updated 5 years ago
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆28Updated 3 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆106Updated 3 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 9 years ago
- A proof of concept using Divolte, Kafka, Druid and Superset☆61Updated 5 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆33Updated 2 years ago
- This is a GitHub for all of my NiFi Templates☆47Updated 5 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago