shawnbrown / datatestLinks
Tools for test driven data-wrangling and data validation.
☆294Updated 3 years ago
Alternatives and similar repositories for datatest
Users that are interested in datatest are comparing it to the libraries listed below
Sorting:
- A library for defensive data analysis.☆500Updated 5 years ago
- Test-Driven Data Analysis Functions☆299Updated this week
- A Pandas Styler class for making beautiful tables☆415Updated 2 years ago
- Interactive plotting for Pandas using Vega-Lite☆344Updated 6 years ago
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆519Updated last month
- Easy to use test framework for Jupyter Notebooks☆310Updated 2 years ago
- Easy pipelines for pandas DataFrames.☆720Updated this week
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆225Updated 5 years ago
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common fun…☆215Updated 4 years ago
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆171Updated 6 years ago
- The easy way to write your own flavor of Pandas☆307Updated 2 weeks ago
- SQLCell is a magic function for the Jupyter Notebook that executes raw, parallel, parameterized SQL queries with the ability to accept Py…☆151Updated 2 years ago
- Render sparkline style charts in pandas dataframes☆93Updated 4 years ago
- Tools for exploratory data analysis in Python☆644Updated last year
- Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.☆468Updated 4 months ago
- Declarative statistical visualization library for Python☆237Updated 6 years ago
- Time everything in IPython☆124Updated last year
- ☆84Updated 7 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆239Updated 6 years ago
- Jupyter Notebooks as plain Python code with embedded Markdown text☆249Updated 5 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- Design documents and code for the pandas 2.0 effort.☆304Updated 6 years ago
- Summarise and explore Pandas DataFrames☆98Updated 5 years ago
- a python grammar for evolutionary algorithms and heuristics☆190Updated 3 years ago
- Start a cluster in EC2 for dask.distributed☆106Updated 4 years ago
- Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models☆490Updated 7 years ago
- Data exploration glue☆351Updated 7 months ago
- A Python library for unevenly-spaced time series analysis☆532Updated 4 months ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated 10 months ago
- Lazydata: Scalable data dependencies for Python projects☆621Updated 6 years ago