whylabs / whylogs-protoLinks
Protobuf definition for WhyLogs format
☆14Updated 4 years ago
Alternatives and similar repositories for whylogs-proto
Users that are interested in whylogs-proto are comparing it to the libraries listed below
Sorting:
- Profile and monitor your ML data pipeline end-to-end☆179Updated 3 years ago
- A collection of WhyLogs examples in various languages☆48Updated last year
- ☆10Updated 8 years ago
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆860Updated 2 years ago
- ☆34Updated 4 years ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,086Updated this week
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two line…☆667Updated 6 months ago
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,254Updated this week
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆869Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,167Updated this week
- Python API for Deequ☆790Updated 5 months ago
- The core cli implementation☆19Updated this week
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning m…☆855Updated last year
- ☆379Updated last year
- What's in your data? Extract schema, statistics and entities from datasets☆1,515Updated 3 weeks ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,108Updated 5 months ago
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆320Updated last year
- ELT Code for your Data Warehouse☆26Updated last year
- Data Quality assessment with one line of code☆449Updated this week
- Hapi?☆14Updated 4 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆87Updated 2 years ago
- lakeFS airflow operator☆27Updated last year
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data acces…☆478Updated 5 months ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,571Updated last year
- Contribute and collaborate on educational content for the Airbyte Community.☆45Updated 4 years ago
- PySpark test helper methods with beautiful error messages☆713Updated last month
- Template for a data contract used in a data mesh.☆476Updated last year
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Joining the modern data stack with the modern ML stack☆199Updated 2 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,501Updated last week