bikash / DataQuality
Tutorial and examples of Data Quality in Big Data System
☆12Updated 7 years ago
Alternatives and similar repositories for DataQuality:
Users that are interested in DataQuality are comparing it to the libraries listed below
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 4 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆20Updated 5 years ago
- ☆12Updated 2 years ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆31Updated last year
- Open-source metadata collector based on ODD Specification☆43Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆27Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Utilities to showcase OpenMetadata☆25Updated 3 months ago
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆36Updated last month
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Tool to automate data quality checks on data pipelines☆254Updated 2 years ago
- Data science, machine learning tools on the cloud☆15Updated 4 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- Examples of using vaex☆72Updated 11 months ago
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆103Updated 3 years ago
- Apache NiFi Custom Processor Extracting Text From Files with Apache Tika☆35Updated last year
- Superset Quick Start Guide, published by Packt☆56Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆133Updated last month
- Data Lineage Tracing Library☆22Updated 3 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆123Updated 9 months ago
- a collection of resources and blogs about Apache Superset☆82Updated 3 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆106Updated this week
- ☆49Updated 5 years ago
- DataQuality for BigData☆144Updated last year
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆112Updated last year
- Generate and Visualize Data Lineage from query history☆322Updated last year
- Apache DataLab (incubating)☆153Updated last year
- The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …☆45Updated this week