sfu-db / dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
☆2,060Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for dataprep
- Visualize and compare datasets, target values and associations, with one line of code.☆2,944Updated 3 months ago
- Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Gra…☆1,725Updated 4 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,357Updated this week
- Build and share data reports in 100% Python☆1,381Updated last year
- Visualizer for pandas data structures☆4,768Updated 2 weeks ago
- Extra blocks for scikit-learn pipelines.☆1,271Updated this week
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,177Updated 7 months ago
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,305Updated 3 months ago
- Feature engineering package with sklearn like functionality☆1,913Updated this week
- Prepping tables for machine learning☆1,207Updated this week
- Pandas DataFrames as Interactive DataTables☆795Updated this week
- A Python package for manipulating 2-dimensional tabular data structures☆1,817Updated 2 weeks ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,003Updated last month
- Easy to use Python library of customized functions for cleaning and analyzing data.☆500Updated last week
- bamboolib - a GUI for pandas DataFrames☆939Updated 8 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,534Updated 7 months ago
- A python library for decision tree visualization and model interpretation.☆2,957Updated 2 months ago
- Predictive Power Score (PPS) in Python☆1,115Updated 8 months ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,679Updated 4 months ago
- Statistical package in Python based on Pandas☆1,625Updated 3 weeks ago
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,128Updated this week
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆495Updated last month
- A light-weight, flexible, and expressive statistical data testing library☆3,364Updated last week
- Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.☆951Updated 6 months ago
- Panel: The powerful data exploration & web app framework for Python☆4,762Updated this week
- Missing data visualization module for Python.☆3,952Updated 5 months ago
- EvalML is an AutoML library written in python.☆774Updated this week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,515Updated this week
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,479Updated this week
- Visualize large time series data with plotly.py☆1,035Updated last week