whylabs / whylogs
An open-source data logging library for machine learning models and data pipelines. π Provides visibility into data quality & model performance over time. π‘οΈ Supports privacy-preserving data collection, ensuring safety & robustness. π
β2,709Updated 3 months ago
Alternatives and similar repositories for whylogs:
Users that are interested in whylogs are comparing it to the libraries listed below
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,074Updated 3 weeks ago
- Algorithms for outlier, adversarial and drift detectionβ2,346Updated this week
- Protobuf definition for WhyLogs formatβ14Updated 3 years ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,781Updated last month
- A collection of WhyLogs examples in various languagesβ48Updated 11 months ago
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,557Updated 7 months ago
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β6,065Updated this week
- An end-to-end implementation of intent prediction with Metaflow and other cool toolsβ859Updated last year
- Source code of the WhyLabs Platform OSSβ10Updated 3 months ago
- nannyml: post-deployment data science in pythonβ2,055Updated this week
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integraβ¦β1,518Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,067Updated this week
- Data Quality assessment with one line of codeβ438Updated 3 weeks ago
- Monitor the stability of a Pandas or Spark dataframe βοΈβ500Updated 3 months ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,882Updated this week
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β720Updated last year
- β704Updated 2 years ago
- Hopsworks - Data-Intensive AI platform with a Feature Storeβ1,221Updated 2 months ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.β3,406Updated last week
- Always know what to expect from your data.β10,342Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadaβ¦β2,111Updated 2 weeks ago
- Python API for Deequβ765Updated 3 weeks ago
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonβ861Updated last year
- Algorithms for explaining machine learning modelsβ2,488Updated 3 weeks ago
- What's in your data? Extract schema, statistics and entities from datasetsβ1,477Updated last month
- Luminaire is a python package that provides ML driven solutions for monitoring time series data.β777Updated last year
- Feature engineering package with sklearn like functionalityβ2,036Updated this week
- The Fuzzy Labs guide to the universe of open source MLOpsβ461Updated 9 months ago
- Synthetic data generation for tabular dataβ2,605Updated this week
- A light-weight, flexible, and expressive statistical data testing libraryβ3,762Updated this week