whylabs / whylogs-protoLinks
Protobuf definition for WhyLogs format
☆14Updated 4 years ago
Alternatives and similar repositories for whylogs-proto
Users that are interested in whylogs-proto are comparing it to the libraries listed below
Sorting:
- Profile and monitor your ML data pipeline end-to-end☆177Updated 4 years ago
- A collection of WhyLogs examples in various languages☆49Updated last year
- An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…☆2,783Updated 11 months ago
- ☆10Updated 8 years ago
- Python API for Deequ☆806Updated 8 months ago
- Python API for Deequ☆41Updated 5 years ago
- The core cli implementation☆19Updated 2 months ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆669Updated 5 months ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,264Updated this week
- Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!☆620Updated last week
- PySpark test helper methods with beautiful error messages☆739Updated 2 weeks ago
- Spark style guide☆266Updated last year
- pyspark methods to enhance developer productivity 📣 👯 🎉☆678Updated 9 months ago
- Source code of the WhyLabs Platform OSS☆41Updated 10 months ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,570Updated last year
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆873Updated 2 years ago
- PySpark data-pipeline testing and CICD☆28Updated 5 years ago
- ☆202Updated 2 years ago
- Export and import MLflow experiments, runs or registered models☆80Updated 3 years ago
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,333Updated 2 weeks ago
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data acces…☆479Updated 9 months ago
- Minimal deployment of Great Expectations on lambda☆11Updated 5 years ago
- Metadata service library for Amundsen☆82Updated 3 weeks ago
- Data pipeline with dbt, Airflow, Great Expectations☆165Updated 4 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,128Updated 3 weeks ago
- Queries the ACCESS_HISTORY and QUERY_HISTORY views, from the SNOWFLAKE.ACCOUNT_USAGE schema, and generates two interactive GraphViz visua…☆12Updated 2 years ago
- Template for a data contract used in a data mesh.☆485Updated last year
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆860Updated 2 years ago
- A Spark library for Amazon SageMaker.☆301Updated 9 months ago