brain-research / data-linterLinks
The Data Linter identifies potential issues (lints) in your ML training data.
β88Updated 7 years ago
Alternatives and similar repositories for data-linter
Users that are interested in data-linter are comparing it to the libraries listed below
Sorting:
- Flow with FlorDB π»β154Updated 2 months ago
- Automatically labeling training dataβ107Updated 6 years ago
- β98Updated 5 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbationβ¦β165Updated last month
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ95Updated 4 years ago
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.β102Updated 3 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ41Updated 5 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"β24Updated 5 years ago
- A Machine Learning System for Data Enrichment.β75Updated 6 years ago
- Questions, Help, and Issues for Comet MLβ86Updated 5 months ago
- A JSON-based schema for storing declarative descriptions of machine learning experimentsβ45Updated 8 years ago
- AutoBazaar: An AutoML System from the Machine Learning Bazaarβ33Updated 4 years ago
- NEXT is a machine learning system that runs in the cloud and makes it easy to develop, evaluate, and apply active learning in the real-woβ¦β163Updated last year
- Projects developed by Domino's R&D teamβ78Updated 3 years ago
- Utilities for preprocessing text for deep learning with Kerasβ180Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processingβ112Updated last year
- β89Updated 7 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- Presentations & notebooks from our talks /workshops/meetups/etcβ24Updated 7 years ago
- Research code for auditing and exploring black box machine-learning models.β132Updated 2 years ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning woβ¦β172Updated 2 years ago
- Experiment tracking for machine and deep learning projectsβ128Updated last year
- Primrose modeling framework for simple production modelsβ32Updated last year
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librarβ¦β30Updated 3 years ago
- The deepr module provide abstractions (layers, readers, prepro, metrics, config) to help build tensorflow models on top of tf estimatorsβ53Updated last year
- β96Updated 5 years ago
- Simplified tree-based classifier and regressor for interpretable machine learning (scikit-learn compatible)β46Updated 4 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.β44Updated 2 years ago
- Distribution transparent Machine Learning experiments on Apache Sparkβ91Updated last year