ydataai / ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
☆12,385Updated last week
Related projects: ⓘ
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,257Updated this week
- Missing data visualization module for Python.☆3,896Updated 4 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆9,747Updated this week
- An open-source, low-code machine learning library in Python☆8,818Updated 2 weeks ago
- Declarative statistical visualization library for Python☆9,227Updated this week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,264Updated last year
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆8,186Updated last month
- An open source python library for automated feature engineering☆7,196Updated this week
- 🦉 ML Experiments and Data Management with Git☆13,608Updated this week
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,655Updated last month
- Statistical data visualization in Python☆12,401Updated last month
- A library of extension and helper modules for Python's data analysis and machine learning libraries.☆4,862Updated 2 months ago
- Automated Machine Learning with scikit-learn☆7,542Updated last week
- Statsmodels: statistical modeling and econometrics in Python☆9,978Updated this week
- Build and manage real-life ML, AI, and data science projects with ease!☆8,046Updated this week
- Visualize and compare datasets, target values and associations, with one line of code.☆2,909Updated last month
- Parallel computing with task scheduling☆12,405Updated this week
- Open source platform for the machine learning lifecycle☆18,340Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆5,789Updated 3 weeks ago
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆6,805Updated 3 months ago
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,136Updated 5 months ago
- A game theoretic approach to explain the output of any machine learning model.☆22,506Updated this week
- Voilà turns Jupyter notebooks into standalone web applications☆5,394Updated 2 weeks ago
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,634Updated 2 months ago
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆18,277Updated 2 weeks ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,515Updated 5 months ago
- The interactive graphing library for Python This project now includes Plotly Express!☆16,007Updated this week
- Visualizer for pandas data structures☆4,703Updated last week
- Custom Jupyter Notebook Themes☆9,762Updated 11 months ago
- Automatic extraction of relevant features from time series:☆8,361Updated last month