A framework for systematically quality controlling big data.
☆40Mar 13, 2023Updated 3 years ago
Alternatives and similar repositories for TopNotch
Users that are interested in TopNotch are comparing it to the libraries listed below
Sorting:
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- Library to run in process Kafka broker☆16Nov 20, 2018Updated 7 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- ☆11Nov 29, 2020Updated 5 years ago
- A collection of Apache Parquet add-on modules☆30Mar 3, 2026Updated 2 weeks ago
- An columnar serializer☆15Feb 26, 2016Updated 10 years ago
- Watching the FISA Court's public docket.☆43Dec 19, 2014Updated 11 years ago
- Code and templates required to build the DARPA open catalog.☆18Mar 23, 2016Updated 9 years ago
- i2dash: interactive and iterative dashboards.☆10Sep 5, 2023Updated 2 years ago
- Sorenson Impact Package☆13Nov 4, 2021Updated 4 years ago
- Optimus is a mathematical programming library for Scala.☆149Feb 1, 2026Updated last month
- EJBCA PKI Engine and Backend for HashiCorp Vault. Used to issue, sign, and revoke certificates using the EJBCA CA.☆11Dec 18, 2025Updated 3 months ago
- Intercepts HTTP calls and allows fake implementations to take over entire domains. Used for testing.☆13Oct 20, 2015Updated 10 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- ☆92Nov 15, 2015Updated 10 years ago
- JMESPath with extended collection of built-in functions☆15Jan 7, 2023Updated 3 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Convex optimization for fun and profit.☆11Jan 12, 2022Updated 4 years ago
- angular-meteor version of Thinkster.io's mean-stack-tutorial☆10Oct 16, 2017Updated 8 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- Sangria circe marshalling☆24Updated this week
- ☆18May 4, 2023Updated 2 years ago
- Amazon Web Services Bundle Package☆15Jan 12, 2020Updated 6 years ago
- Pyspark Notebook With Docker☆11Aug 18, 2015Updated 10 years ago
- ☆38Jun 1, 2021Updated 4 years ago
- Experiments with scala native & libpcap☆10Mar 30, 2018Updated 7 years ago
- Data pipeline automation tool☆27Jan 11, 2024Updated 2 years ago
- HopsYARN Tensorflow Framework.☆31Oct 22, 2019Updated 6 years ago
- ☆11Sep 1, 2020Updated 5 years ago
- Code of the book "Getting started with the Julia Programming Language"☆11Jul 7, 2018Updated 7 years ago
- This repository contains the source for a json-json transformation processor for apache NiFi☆12Jun 21, 2015Updated 10 years ago
- Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…☆11Apr 10, 2013Updated 12 years ago
- Windows Data and Analytics Shared Code - JSON Processing☆15Jun 12, 2023Updated 2 years ago
- Modify data records using separately defined modification rules