A framework for systematically quality controlling big data.
☆41Mar 13, 2023Updated 3 years ago
Alternatives and similar repositories for TopNotch
Users that are interested in TopNotch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This Is Indian Country - Spring 2018 Instance☆12Apr 30, 2018Updated 8 years ago
- Simple Spark example of generating table stats for use of data quality checks☆27Apr 28, 2017Updated 9 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Scala library for converting Spark rows to case classes☆11Mar 14, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30May 13, 2026Updated 3 weeks ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 11 years ago
- Scala, DSL, Rules based reactive workflows and Microservices☆14Oct 20, 2025Updated 7 months ago
- ☆11Nov 29, 2020Updated 5 years ago
- A collection of Apache Parquet add-on modules☆30May 20, 2026Updated 3 weeks ago
- An columnar serializer☆15Feb 26, 2016Updated 10 years ago
- We have developed Cordova Plugins for 3 Samsung SDKs SPen, Multiwindow & Rich Notifications and need to release them on Github. And we ar…☆17Feb 10, 2016Updated 10 years ago
- Code and templates required to build the DARPA open catalog.☆18Mar 23, 2016Updated 10 years ago
- ☆12Nov 6, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kafka Connect Vespa sink connector☆17Apr 17, 2025Updated last year
- Sorenson Impact Package☆13Nov 4, 2021Updated 4 years ago
- Rust tools for working with CSV files: scrubcsv, catcsv, fixed2csv, geochunk, hashcsv.☆19Jan 17, 2026Updated 4 months ago
- Optimus is a mathematical programming library for Scala.☆150Mar 16, 2026Updated 2 months ago
- Public Comment Analysis Project for the Federal Chief Data Officer Council. The Comment Analysis pilot has shown that a toolset leveragin…☆13Sep 17, 2021Updated 4 years ago
- EJBCA PKI Engine and Backend for HashiCorp Vault. Used to issue, sign, and revoke certificates using the EJBCA CA.☆12Dec 18, 2025Updated 5 months ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 3 years ago
- ☆92Nov 15, 2015Updated 10 years ago
- Java API library for BSI TR-03110 cv certificates used for Extended Access Control (EAC)☆12Apr 27, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CentCom is a suite of software used for implementing a data warehouse of bans for Space Station 13 from a variety of public sources.☆11Apr 18, 2026Updated last month
- Cis Recommender☆16May 1, 2012Updated 14 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- ☆30Aug 8, 2015Updated 10 years ago
- Sangria circe marshalling☆24May 25, 2026Updated 2 weeks ago
- Time series analysis with Apache Spark based on Chronix |☆38Mar 15, 2017Updated 9 years ago
- Frontend FIDS service for FlightAware sample apps☆13Jan 6, 2023Updated 3 years ago
- Amazon Web Services Bundle Package☆15Jan 12, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pyspark Notebook With Docker☆11Aug 18, 2015Updated 10 years ago
- ☆38Jun 1, 2021Updated 5 years ago
- Experiments with scala native & libpcap☆10Mar 30, 2018Updated 8 years ago
- Automation of JupyterHub operations and testing☆14Aug 25, 2022Updated 3 years ago
- HopsYARN Tensorflow Framework.☆32Oct 22, 2019Updated 6 years ago
- ☆33Mar 12, 2017Updated 9 years ago
- Windows Data and Analytics Shared Code - JSON Processing☆15Jun 12, 2023Updated 2 years ago