brain-research / data-linterLinks
The Data Linter identifies potential issues (lints) in your ML training data.
β88Updated 8 years ago
Alternatives and similar repositories for data-linter
Users that are interested in data-linter are comparing it to the libraries listed below
Sorting:
- FlorDB π»β158Updated 3 months ago
- β98Updated 5 years ago
- Automatically labeling training dataβ107Updated 7 years ago
- A JSON-based schema for storing declarative descriptions of machine learning experimentsβ46Updated 3 weeks ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning woβ¦β174Updated last month
- β103Updated 2 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbationβ¦β164Updated 7 months ago
- Practical ideas on securing machine learning modelsβ37Updated 4 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"β24Updated 6 years ago
- Research code for auditing and exploring black box machine-learning models.β132Updated 2 years ago
- Know your ML Score based on Sculley's paperβ34Updated 6 years ago
- Questions, Help, and Issues for Comet MLβ87Updated 11 months ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processingβ113Updated last year
- Projects developed by Domino's R&D teamβ77Updated 3 years ago
- this repo might get acceptedβ28Updated 5 years ago
- AutoBazaar: An AutoML System from the Machine Learning Bazaarβ33Updated 4 years ago
- State management framework for Data Science & Analyticsβ19Updated 6 years ago
- β96Updated 5 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ96Updated 5 years ago
- A library for composing end-to-end tunable machine learning pipelines.β122Updated last year
- NEXT is a machine learning system that runs in the cloud and makes it easy to develop, evaluate, and apply active learning in the real-woβ¦β163Updated last year
- Public repository for versioning machine learning dataβ42Updated 4 years ago
- Presentations & notebooks from our talks /workshops/meetups/etcβ24Updated 7 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librarβ¦β30Updated 3 years ago
- Automated machine learning (AutoML) with grammar-based genetic programmingβ54Updated last year
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.β104Updated 2 weeks ago
- Helpers for constructing scikit-learn grid searchβ39Updated 5 years ago
- Utilities for preprocessing text for deep learning with Kerasβ180Updated 3 years ago
- π― kettle is a CLI tool for creating and deploying cloud functions & docker containers for machine learningβ31Updated 3 years ago