Swiple / swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
☆82Updated this week
Alternatives and similar repositories for swiple:
Users that are interested in swiple are comparing it to the libraries listed below
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Cloud-agnostic Python API☆61Updated 9 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆184Updated last week
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- New generation opensource data stack☆65Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆134Updated 2 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week
- ☆69Updated last month
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆92Updated 4 months ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆44Updated 2 years ago
- ☆73Updated 7 months ago
- Simple FastAPI declarative endpoint-level access control.☆98Updated 10 months ago
- ✨ A Pydantic to PySpark schema library☆75Updated this week
- All things awesome related to Dagster!☆100Updated last month
- Write your dbt models using Ibis☆64Updated 2 weeks ago
- Prefect API Authentication/Authorization Proxy for on-premises deployments☆38Updated 3 months ago
- Elevate your 🐍 code with optimal data structure recommendations from pyggester.☆88Updated last year
- Code examples showing flow deployment to various types of infrastructure☆105Updated 2 years ago
- Example repository showing how to build a data platform with Prefect, dbt and Snowflake☆100Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated last week
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆28Updated 3 weeks ago
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆70Updated 3 weeks ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- Making DAG construction easier☆258Updated 3 weeks ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago