unionai-oss/pandera

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/unionai-oss/pandera)

unionai-oss / pandera

A light-weight, flexible, and expressive statistical data testing library

☆4,410

Alternatives and similar repositories for pandera

Users that are interested in pandera are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,668Updated this week
ibis-project / ibis
View on GitHub
the portable Python dataframe library
☆6,609Updated this week
fugue-project / fugue
View on GitHub
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…
☆2,170May 19, 2026Updated 2 months ago
pola-rs / polars
View on GitHub
Extremely fast Query Engine for DataFrames, written in Rust
☆39,100Updated this week
narwhals-dev / narwhals
View on GitHub
Lightweight and extensible compatibility layer between dataframe libraries!
☆1,687Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kedro-org / kedro
View on GitHub
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…
☆10,931Updated this week
modin-project / modin
View on GitHub
Modin: Scale your Pandas workflows by changing a single line of code
☆10,393Feb 10, 2026Updated 5 months ago
dagster-io / dagster
View on GitHub
An orchestration platform for the development, production, and observation of data assets.
☆15,898Updated this week
JakobGM / patito
View on GitHub
A data modelling layer built on top of polars and pydantic
☆634May 8, 2026Updated 2 months ago
Data-Centric-AI-Community / fg-data-profiling
View on GitHub
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
☆13,653Apr 22, 2026Updated 3 months ago
pydantic / pydantic
View on GitHub
Data validation using Python type hints
☆28,392Updated this week
pyjanitor-devs / pyjanitor
View on GitHub
Clean APIs for data cleaning. Python implementation of R package Janitor
☆1,501Updated this week
vaexio / vaex
View on GitHub
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…
☆8,510Apr 1, 2026Updated 3 months ago
treeverse / dvc
View on GitHub
🦉 Data Versioning and ML Experiments
☆15,774Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PrefectHQ / prefect
View on GitHub
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆23,479Updated this week
astral-sh / ruff
View on GitHub
An extremely fast Python linter and code formatter, written in Rust.
☆48,838Updated this week
sktime / sktime
View on GitHub
A unified framework for machine learning with time series
☆9,880Updated this week
ploomber / ploomber
View on GitHub
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
☆3,622May 29, 2025Updated last year
fastapi / typer
View on GitHub
Typer, build great CLIs. Easy to code. Based on Python type hints.
☆19,806Updated this week
nteract / papermill
View on GitHub
📚 Parameterize, execute, and analyze notebooks
☆6,461Jul 6, 2026Updated 2 weeks ago
Netflix / metaflow
View on GitHub
Build, Manage and Deploy AI/ML Systems
☆10,196Updated this week
koaning / scikit-lego
View on GitHub
Extra blocks for scikit-learn pipelines.
☆1,409Updated this week
HypothesisWorks / hypothesis
View on GitHub
The property-based testing library for Python
☆8,818Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Nixtla / statsforecast
View on GitHub
Lightning ⚡️ fast forecasting with statistical and econometric models.
☆4,850Updated this week
whylabs / whylogs
View on GitHub
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…
☆2,829Jan 10, 2025Updated last year
apache / hamilton
View on GitHub
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…
☆2,552Updated this week
flyteorg / flyte
View on GitHub
Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.
☆7,149Updated this week
marimo-team / marimo
View on GitHub
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with gi…
☆22,043Updated this week
online-ml / river
View on GitHub
🌊 Online machine learning in Python
☆5,888Updated this week
evidentlyai / evidently
View on GitHub
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…
☆7,748May 2, 2026Updated 2 months ago
NannyML / nannyml
View on GitHub
nannyml: post-deployment data science in python
☆2,146Jul 12, 2025Updated last year
unit8co / darts
View on GitHub
A python library for user-friendly forecasting and anomaly detection on time series.
☆9,475Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
deepchecks / deepchecks
View on GitHub
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…
☆4,039Dec 28, 2025Updated 6 months ago
lux-org / lux
View on GitHub
Automatically visualize your pandas dataframe via a single print! 📊 💡
☆5,377Mar 20, 2024Updated 2 years ago
sfu-db / connector-x
View on GitHub
Fastest library to load data from DB to DataFrames in Rust and Python
☆2,638Updated this week
Quantco / dataframely
View on GitHub
A declarative, 🐻‍❄️-native data frame validation library.
☆608Updated this week
posit-dev / great-tables
View on GitHub
Make awesome display tables using Python
☆2,704Updated this week
sodadata / soda-core
View on GitHub
Data Contracts engine for the modern data stack. https://www.soda.io
☆2,397Updated this week
SQLMesh / sqlmesh
View on GitHub
Scalable and efficient data transformation framework - backwards compatible with dbt.
☆3,222Updated this week