AutoViML / pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
☆127Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for pandas_dq
- Slides for "Feature engineering for time series forecasting" talk☆57Updated 2 years ago
- summarytools in jupyter notebook☆98Updated 3 months ago
- Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit☆122Updated 6 months ago
- Explore and compare 1K+ accurate decision trees in your browser!☆153Updated 8 months ago
- Feature engineering package with sklearn like functionality☆51Updated 2 months ago
- Tutorials on creating a reproducible and maintainable data science project☆138Updated 2 years ago
- ☆31Updated 6 months ago
- Streamline scikit-learn model comparison.☆146Updated last year
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆37Updated 2 years ago
- ☆32Updated last year
- pipreqs with jupyter notebook support☆66Updated last year
- ☆273Updated last year
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆165Updated 2 months ago
- Demo for CI/CD in a machine learning project☆93Updated last year
- ☆38Updated 2 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆63Updated last month
- Code and materials for Effective Polars book☆69Updated 7 months ago
- An end-to-end project on customer segmentation☆82Updated last year
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆205Updated 3 weeks ago
- Resources for some of our education content☆30Updated last week
- Demo on how to use Prefect 2 in an ML project☆40Updated 2 years ago
- MLOps maturity assessment☆57Updated last year
- ☆148Updated last year
- PyData London 2022 Tutorial☆66Updated 2 years ago
- An unsupervised feature selection technique using supervised algorithms such as XGBoost☆88Updated 10 months ago
- Develop and deploy a real-time feature pipeline in Python, using Bytewax 🐝 and Hopsworks Feature Store.☆126Updated last year
- An abstraction layer for parameter tuning☆36Updated 2 months ago
- Repository for the explanation method Calibrated Explanations (CE)☆54Updated this week
- Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications☆97Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 7 months ago