sfu-db / dataprepLinks
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
☆2,172Updated 11 months ago
Alternatives and similar repositories for dataprep
Users that are interested in dataprep are comparing it to the libraries listed below
Sorting:
- Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Gra…☆1,812Updated last year
- Visualize and compare datasets, target values and associations, with one line of code.☆3,021Updated 10 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,426Updated this week
- bamboolib - a GUI for pandas DataFrames☆949Updated last year
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,616Updated last year
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,282Updated last year
- Feature engineering package with sklearn like functionality☆2,070Updated last month
- Easy to use Python library of customized functions for cleaning and analyzing data.☆514Updated last month
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,979Updated this week
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.☆2,403Updated 2 weeks ago
- Visualizer for pandas data structures☆4,927Updated 3 months ago
- Predictive Power Score (PPS) in Python☆1,148Updated 5 months ago
- The purpose of this project is to share knowledge on how awesome Streamlit is and can be☆2,165Updated 2 years ago
- Data Quality assessment with one line of code☆444Updated this week
- 🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models☆2,895Updated last week
- Extra blocks for scikit-learn pipelines.☆1,344Updated last week
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,443Updated 2 months ago
- Bokeh Plotting Backend for Pandas and GeoPandas☆885Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,087Updated 2 months ago
- Elyra extends JupyterLab with an AI centric approach.☆1,930Updated last week
- HiPlot makes understanding high dimensional data easy☆2,793Updated last year
- A light-weight, flexible, and expressive statistical data testing library☆3,854Updated last week
- Statistical package in Python based on Pandas☆1,788Updated 3 months ago
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,334Updated this week
- A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews☆1,198Updated this week
- A set of data tools in Python☆502Updated 5 months ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,586Updated 3 weeks ago
- Natural Intelligence is still a pretty good idea.☆815Updated 11 months ago
- Build and share data reports in 100% Python☆1,395Updated last year
- High-level tools to simplify visualization in Python.☆875Updated 2 months ago