piotr-kalanski/data-quality-monitoring

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/piotr-kalanski/data-quality-monitoring)

piotr-kalanski / data-quality-monitoring

Data Quality Monitoring Tool

☆15

Alternatives and similar repositories for data-quality-monitoring

Users that are interested in data-quality-monitoring are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bikash / DataQuality
View on GitHub
Tutorial and examples of Data Quality in Big Data System
☆11Apr 25, 2017Updated 9 years ago
Impetus / jumbune
View on GitHub
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…
☆73Jan 1, 2023Updated 3 years ago
richardswinbank / community
View on GitHub
☆30Apr 6, 2025Updated last year
ebonnal / delta-lake-ui
View on GitHub
[student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions
☆12Apr 21, 2020Updated 6 years ago
Varal7 / opendata-ratp
View on GitHub
Demo for making use of RATP's real-time API
☆13May 3, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mchon89 / Google_App_Engine_Demo
View on GitHub
Deploying a simple, customized Flask API in python via Google App Engine
☆13Aug 20, 2017Updated 8 years ago
aravinthsci / Spark_Delta_Lake
View on GitHub
Delta Lake Examples
☆11Apr 24, 2020Updated 6 years ago
Hamza88-coder / Real-Time-Recruitment-System-with-AI-and-Data-Analytics
View on GitHub
Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…
☆14Dec 25, 2024Updated last year
randerzander / HiveToPhoenix
View on GitHub
An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase
☆14Mar 23, 2016Updated 10 years ago
hammerlab / spark-util
View on GitHub
low-level helpers for Apache Spark libraries and tests
☆16Dec 29, 2018Updated 7 years ago
milinda / KafkaOnEC2
View on GitHub
Ansible scripts for deploying Kafka on EC2
☆10Oct 7, 2016Updated 9 years ago
lyrixx / ratp
View on GitHub
A little crawler/sdk for retrieve in real time information about transport in Paris
☆18Oct 24, 2015Updated 10 years ago
rohankhudedev / HackerRank
View on GitHub
My HackerRank Solutions : https://www.hackerrank.com/RohanKhude
☆12Jul 13, 2016Updated 10 years ago
univalence / centrifuge
View on GitHub
Data quality tools for Big Data
☆19Oct 10, 2019Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
june505 / SpellCorrect
View on GitHub
拼写纠错程序，涉及建词典、索引，动态规划算法，线程池池，线程同步，socket网络编程等
☆14Sep 23, 2015Updated 10 years ago
KeithSSmith / spark-compaction
View on GitHub
File compaction tool that runs on top of the Spark framework.
☆59Apr 17, 2019Updated 7 years ago
hurtn / databricks
View on GitHub
☆12Aug 6, 2020Updated 5 years ago
aws-samples / amazon-deequ-glue
View on GitHub
Automated data quality suggestions and analysis with Deequ on AWS Glue
☆93Dec 29, 2022Updated 3 years ago
mingchuno / aws-wrap
View on GitHub
Asynchronous Scala Clients for Amazon Web Services
☆13Jul 31, 2017Updated 8 years ago
open-data-toronto / framework-data-quality
View on GitHub
☆10Jun 29, 2023Updated 3 years ago
sidaw / sempre-interactive
View on GitHub
Semantic Parser with Execution
☆13Dec 8, 2017Updated 8 years ago
eviltik / evildns
View on GitHub
A (massive) DNS tools (reverse lookup for now)
☆12Jul 6, 2022Updated 4 years ago
peterservice-rnd / robotframework-testrail
View on GitHub
Robot Framework library, listener and pre-run modifier for working with TestRail
☆15Oct 10, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sidneyocirqueira / azure-synapse-analytics
View on GitHub
☆12Mar 15, 2022Updated 4 years ago
charlesb / CDF-workshop
View on GitHub
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…
☆19Aug 16, 2019Updated 6 years ago
legrandlegrand / revj2
View on GitHub
fork from reverse snowflake joins (https://sourceforge.net/projects/revj/)
☆17Jun 13, 2021Updated 5 years ago
ivan-zapreev / Distributed-Translation-Infrastructure
View on GitHub
The distributed statistical machine translation infrastructure consisting of load balancing, text pre/post-processing and translation ser…
☆12Nov 29, 2018Updated 7 years ago
AasTrailblazers / AzureSynapse
View on GitHub
☆15Jan 17, 2022Updated 4 years ago
jpzk / cookiecutter-scala-spark
View on GitHub
A cookiecutter template for Apache Spark applications written in Scala
☆10Jan 11, 2019Updated 7 years ago
darsain / volley
View on GitHub
jQuery plugin for dividing and filtering elements based on their visual position.
☆16Mar 16, 2012Updated 14 years ago
SponsorPay / jaquet
View on GitHub
Spark stream from kafka(json) to s3(parquet)
☆15Nov 8, 2018Updated 7 years ago
frictionlessdata / data-quality-cli
View on GitHub
CLI for creating databases for Data Quality Dashboards.
☆19Oct 26, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cocoxu / multip
View on GitHub
source code of Multiple-instance Learning Paraphrase (MultiP) Model for Twitter
☆13Jun 10, 2016Updated 10 years ago
alexkaratarakis / gitignore
View on GitHub
A collection of useful .gitignore templates
☆18Aug 29, 2015Updated 10 years ago
buzzfeed-openlab / edgar-monitor
View on GitHub
A module that processes new Edgar filings and sends out notifications
☆14Dec 28, 2015Updated 10 years ago
mgormley / agiga
View on GitHub
Annotated Gigaword Java API and Command Line Tools
☆15Mar 30, 2016Updated 10 years ago
zygmuntz / metric-learning-for-regression
View on GitHub
Applying metric learning to kin8nm
☆16Nov 10, 2014Updated 11 years ago
eostermueller / heapSpank
View on GitHub
Detect memory leaks in minutes without a heap dump.
☆17Apr 7, 2017Updated 9 years ago
Cascading / cascading.samples
View on GitHub
Sample applications using Cascading
☆20Jun 7, 2015Updated 11 years ago