brain-research / data-linterLinks
The Data Linter identifies potential issues (lints) in your ML training data.
☆89Updated 7 years ago
Alternatives and similar repositories for data-linter
Users that are interested in data-linter are comparing it to the libraries listed below
Sorting:
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆172Updated 2 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 4 months ago
- Automatically labeling training data☆107Updated 6 years ago
- FlorDB 🌻☆155Updated last month
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.☆104Updated 4 years ago
- ☆99Updated 5 years ago
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librar…☆30Updated 3 years ago
- this repo might get accepted☆28Updated 4 years ago
- Projects developed by Domino's R&D team☆77Updated 3 years ago
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 8 years ago
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Updated 6 years ago
- ☆103Updated 2 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 7 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- AutoBazaar: An AutoML System from the Machine Learning Bazaar☆33Updated 4 years ago
- Helpers for constructing scikit-learn grid search☆38Updated 5 years ago
- Primrose modeling framework for simple production models☆33Updated last year
- Willump Is a Low-Latency Useful Machine learning Platform.☆45Updated 2 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆43Updated 4 years ago
- Tutorial for a new versioning Machine Learning pipeline☆80Updated 4 years ago
- Automated machine learning (AutoML) with grammar-based genetic programming☆54Updated last year
- ☆22Updated 2 years ago
- 🎯 kettle is a CLI tool for creating and deploying cloud functions & docker containers for machine learning☆32Updated 2 years ago
- Public repository for versioning machine learning data☆42Updated 3 years ago
- Machine Learning for Information Retrieval☆86Updated 5 months ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- Dummy variable generation with fit/transform capabilities☆23Updated 7 years ago
- ☆96Updated 5 years ago