A framework for systematically quality controlling big data.
☆41Mar 13, 2023Updated 3 years ago
Alternatives and similar repositories for TopNotch
Users that are interested in TopNotch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This Is Indian Country - Spring 2018 Instance☆12Apr 30, 2018Updated 8 years ago
- Simple Spark example of generating table stats for use of data quality checks☆27Apr 28, 2017Updated 9 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- A skills challenge for hiring!☆12Dec 21, 2016Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Scala library for converting Spark rows to case classes☆11Mar 14, 2017Updated 9 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30May 13, 2026Updated last month
- Library to run in process Kafka broker☆16Nov 20, 2018Updated 7 years ago
- A recursive evidence-gated cognitive runtime for memory-native AI agents, combining hybrid retrieval, temporal reasoning, async learning,…☆261Jun 9, 2026Updated 3 weeks ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 11 years ago
- A collection of Apache Parquet add-on modules☆30Jun 14, 2026Updated 2 weeks ago
- An columnar serializer☆15Feb 26, 2016Updated 10 years ago
- We have developed Cordova Plugins for 3 Samsung SDKs SPen, Multiwindow & Rich Notifications and need to release them on Github. And we ar…☆17Feb 10, 2016Updated 10 years ago
- Watching the FISA Court's public docket.☆43Dec 19, 2014Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Nov 6, 2014Updated 11 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 3 years ago
- Rust tools for working with CSV files: scrubcsv, catcsv, fixed2csv, geochunk, hashcsv.☆19Jan 17, 2026Updated 5 months ago
- Optimus is a mathematical programming library for Scala.☆149Mar 16, 2026Updated 3 months ago
- Public Comment Analysis Project for the Federal Chief Data Officer Council. The Comment Analysis pilot has shown that a toolset leveragin…☆13Sep 17, 2021Updated 4 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 3 years ago
- Convex optimization for fun and profit.☆11Jan 12, 2022Updated 4 years ago
- ☆30Aug 8, 2015Updated 10 years ago
- Sangria circe marshalling☆24May 25, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Sphincs+ implementation which uses minimal RAM☆14Aug 31, 2023Updated 2 years ago
- Amazon Web Services Bundle Package☆15Jan 12, 2020Updated 6 years ago
- ☆38Jun 1, 2021Updated 5 years ago
- Experiments with scala native & libpcap☆10Mar 30, 2018Updated 8 years ago
- HopsYARN Tensorflow Framework.☆32Oct 22, 2019Updated 6 years ago
- Code of the book "Getting started with the Julia Programming Language"☆11Jul 7, 2018Updated 7 years ago
- ☆10Oct 21, 2021Updated 4 years ago
- Data pipeline automation tool☆28Jan 11, 2024Updated 2 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- WSLKit is a generic toolkit for Windows Subsystem for Linux (WSL), with a PowerShell API, and support for VPN-friendly networking kit (VP…☆21Apr 23, 2026Updated 2 months ago
- ☆10Apr 6, 2023Updated 3 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- Project skeleton for performance tests with Gatling☆19Nov 7, 2021Updated 4 years ago
- XQuery for Scala☆36Oct 29, 2015Updated 10 years ago
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 3 years ago
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 5 years ago