AutoViML / pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
☆128Updated last year
Alternatives and similar repositories for pandas_dq:
Users that are interested in pandas_dq are comparing it to the libraries listed below
- Streamline scikit-learn model comparison.☆146Updated 2 years ago
- summarytools in jupyter notebook☆103Updated 5 months ago
- Feature engineering package with sklearn like functionality☆52Updated 5 months ago
- ☆33Updated 3 weeks ago
- Explore and compare 1K+ accurate decision trees in your browser!☆159Updated 11 months ago
- ☆32Updated last year
- Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit☆126Updated 9 months ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- Slides for "Feature engineering for time series forecasting" talk☆58Updated 2 years ago
- Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications☆101Updated last year
- The Orange Book of Machine Learning☆33Updated 2 weeks ago
- Code and materials for Effective Polars book☆73Updated 10 months ago
- Implementation of various Machine learning and MLOps applications/tutorials used within my Medium blog.☆9Updated 2 years ago
- Tutorials on creating a reproducible and maintainable data science project☆142Updated 2 years ago
- ☆278Updated last year
- pipreqs with jupyter notebook support☆67Updated last year
- ☆22Updated 2 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆69Updated 2 months ago
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆64Updated 3 weeks ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆112Updated 10 months ago
- Material for PyData NYC Tutorial on Large Scale Timeseries Forecasting☆26Updated 2 years ago
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆165Updated 5 months ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated last year
- It's all in the name☆76Updated last year
- An abstraction layer for parameter tuning☆35Updated 5 months ago
- PyData London 2022 Tutorial☆66Updated 2 years ago
- Python library for Applied Computational Supply Chain & Logistics. Unlock Neural Nets, Bayesian EOQ, Optimization, Time Series, and more …☆82Updated this week
- Example project with a complete MLOps cycle: versioning data, generating reports on pull requests and deploying the model on releases wit…☆47Updated 3 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆50Updated last year
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆206Updated 3 months ago