AutoViML / pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
☆129Updated last year
Alternatives and similar repositories for pandas_dq:
Users that are interested in pandas_dq are comparing it to the libraries listed below
- Streamline scikit-learn model comparison.☆146Updated 2 years ago
- Explore and compare 1K+ accurate decision trees in your browser!☆159Updated last year
- summarytools in jupyter notebook☆104Updated 7 months ago
- Slides for "Feature engineering for time series forecasting" talk☆58Updated 2 years ago
- Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit☆127Updated 10 months ago
- Feature engineering package with sklearn like functionality☆53Updated 6 months ago
- Tutorials on creating a reproducible and maintainable data science project☆143Updated 2 years ago
- ☆33Updated last month
- Implementation of various Machine learning and MLOps applications/tutorials used within my Medium blog.☆10Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- Material for PyData NYC Tutorial on Large Scale Timeseries Forecasting☆26Updated 2 years ago
- Code and materials for Effective Polars book☆75Updated 11 months ago
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆206Updated 4 months ago
- Demo for CI/CD in a machine learning project☆104Updated last year
- Sample projects using Ploomber.☆86Updated last year
- ☆32Updated last year
- An unsupervised feature selection technique using supervised algorithms such as XGBoost☆89Updated last year
- It's all in the name☆76Updated last year
- Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications☆102Updated last year
- An abstraction layer for parameter tuning☆35Updated 6 months ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆72Updated 3 months ago
- 📊 Explain why metrics change by unpacking them☆37Updated 2 months ago
- Repo for Vizzu workshop materials.☆45Updated 11 months ago
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆166Updated 6 months ago
- ☆115Updated last year
- A set of examples illustrating some possible use cases for NannyML☆19Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆37Updated 2 years ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Berlin Time Series Analysis Repository☆98Updated 2 years ago