A framework for systematically quality controlling big data.
☆40Mar 13, 2023Updated 3 years ago
Alternatives and similar repositories for TopNotch
Users that are interested in TopNotch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Spark example of generating table stats for use of data quality checks☆27Apr 28, 2017Updated 9 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Scala library for converting Spark rows to case classes☆11Mar 14, 2017Updated 9 years ago
- ☆14Apr 8, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30Apr 15, 2026Updated 2 weeks ago
- Library to run in process Kafka broker☆16Nov 20, 2018Updated 7 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Scala, DSL, Rules based reactive workflows and Microservices☆14Oct 20, 2025Updated 6 months ago
- ☆11Nov 29, 2020Updated 5 years ago
- An columnar serializer☆15Feb 26, 2016Updated 10 years ago
- Watching the FISA Court's public docket.☆43Dec 19, 2014Updated 11 years ago
- Code and templates required to build the DARPA open catalog.☆18Mar 23, 2016Updated 10 years ago
- ☆12Nov 6, 2014Updated 11 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Demonstration of VPC Peering and VPN connections in AWS☆14Aug 8, 2017Updated 8 years ago
- Rust tools for working with CSV files: scrubcsv, catcsv, fixed2csv, geochunk, hashcsv.☆19Jan 17, 2026Updated 3 months ago
- Optimus is a mathematical programming library for Scala.☆150Mar 16, 2026Updated last month
- EJBCA PKI Engine and Backend for HashiCorp Vault. Used to issue, sign, and revoke certificates using the EJBCA CA.☆11Dec 18, 2025Updated 4 months ago
- Intercepts HTTP calls and allows fake implementations to take over entire domains. Used for testing.☆13Oct 20, 2015Updated 10 years ago
- ☆92Nov 15, 2015Updated 10 years ago
- Java API library for BSI TR-03110 cv certificates used for Extended Access Control (EAC)☆12Updated this week
- JMESPath with extended collection of built-in functions☆15Jan 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- angular-meteor version of Thinkster.io's mean-stack-tutorial☆10Oct 16, 2017Updated 8 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- Sangria circe marshalling☆24Updated this week
- Amazon Web Services Bundle Package☆15Jan 12, 2020Updated 6 years ago
- Experiments with scala native & libpcap☆10Mar 30, 2018Updated 8 years ago
- HopsYARN Tensorflow Framework.☆32Oct 22, 2019Updated 6 years ago
- Einstein's Riddle (aka Zebra Puzzle) formulated as a Prolog program.☆14Dec 24, 2018Updated 7 years ago
- A software exploration tool to support developers during their work☆13May 22, 2022Updated 3 years ago
- Code of the book "Getting started with the Julia Programming Language"☆11Jul 7, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆33Mar 12, 2017Updated 9 years ago
- This repository contains the source for a json-json transformation processor for apache NiFi☆12Jun 21, 2015Updated 10 years ago
- Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…☆12Apr 10, 2013Updated 13 years ago
- Modify data records using separately defined modification rules☆11Jun 14, 2024Updated last year
- WSLKit is a generic toolkit for Windows Subsystem for Linux (WSL), with a PowerShell API, and support for VPN-friendly networking kit (VP…☆21Apr 23, 2026Updated last week
- ☆10Apr 6, 2023Updated 3 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 9 years ago