A framework for systematically quality controlling big data.
☆40Mar 13, 2023Updated 3 years ago
Alternatives and similar repositories for TopNotch
Users that are interested in TopNotch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Spark example of generating table stats for use of data quality checks☆27Apr 28, 2017Updated 9 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- A skills challenge for hiring!☆12Dec 21, 2016Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Scala library for converting Spark rows to case classes☆11Mar 14, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Apr 8, 2017Updated 9 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30May 13, 2026Updated last week
- Library to run in process Kafka broker☆16Nov 20, 2018Updated 7 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Scala, DSL, Rules based reactive workflows and Microservices☆14Oct 20, 2025Updated 7 months ago
- ☆11Nov 29, 2020Updated 5 years ago
- A collection of Apache Parquet add-on modules☆30May 3, 2026Updated 2 weeks ago
- An columnar serializer☆15Feb 26, 2016Updated 10 years ago
- 🦆 Blazing Fast and highly customizable Github Action to setup a DuckDb runtime☆13May 12, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Sorenson Impact Package☆13Nov 4, 2021Updated 4 years ago
- Optimus is a mathematical programming library for Scala.☆150Mar 16, 2026Updated 2 months ago
- ☆92Nov 15, 2015Updated 10 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Convex optimization for fun and profit.☆11Jan 12, 2022Updated 4 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- Sangria circe marshalling☆24Apr 25, 2026Updated 3 weeks ago
- Time series analysis with Apache Spark based on Chronix |☆38Mar 15, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Sphincs+ implementation which uses minimal RAM☆14Aug 31, 2023Updated 2 years ago
- Amazon Web Services Bundle Package☆15Jan 12, 2020Updated 6 years ago
- Pyspark Notebook With Docker☆11Aug 18, 2015Updated 10 years ago
- ☆38Jun 1, 2021Updated 4 years ago
- Experiments with scala native & libpcap☆10Mar 30, 2018Updated 8 years ago
- HopsYARN Tensorflow Framework.☆32Oct 22, 2019Updated 6 years ago
- ☆33Mar 12, 2017Updated 9 years ago
- This repository contains the source for a json-json transformation processor for apache NiFi☆12Jun 21, 2015Updated 10 years ago
- Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…☆12Apr 10, 2013Updated 13 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Data pipeline automation tool☆28Jan 11, 2024Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- ☆22Jun 10, 2018Updated 7 years ago
- Spark Connector for Hazelcast☆22Jun 9, 2021Updated 4 years ago
- Apache NiFi WebSocket Listener☆10Oct 18, 2015Updated 10 years ago
- Order Book Imbalance trading strategy☆11Nov 21, 2022Updated 3 years ago
- Distributed t-SNE via Apache Spark☆159Dec 9, 2017Updated 8 years ago