AutoViML / pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
☆130Updated last year
Alternatives and similar repositories for pandas_dq:
Users that are interested in pandas_dq are comparing it to the libraries listed below
- summarytools in jupyter notebook☆107Updated 8 months ago
- Feature engineering package with sklearn like functionality☆54Updated 7 months ago
- Streamline scikit-learn model comparison.☆145Updated 2 years ago
- Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit☆127Updated 11 months ago
- Explore and compare 1K+ accurate decision trees in your browser!☆160Updated last year
- Slides for "Feature engineering for time series forecasting" talk☆59Updated 2 years ago
- ☆281Updated last year
- Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications☆103Updated last year
- ☆32Updated last year
- Tools to Transform a Time Series into Features and Target a.k.a Supervised Learning☆97Updated last year
- An unsupervised feature selection technique using supervised algorithms such as XGBoost☆90Updated last year
- Material for PyData NYC Tutorial on Large Scale Timeseries Forecasting☆26Updated 2 years ago
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆65Updated 2 months ago
- Code and materials for Effective Polars book☆79Updated last year
- Tutorials on creating a reproducible and maintainable data science project☆143Updated 2 years ago
- ☆34Updated 2 months ago
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆165Updated 7 months ago
- ☆150Updated 2 years ago
- A FastMCP tool to search and retrieve Polars API documentation.☆27Updated this week
- Start a data science project with modern tools☆193Updated last year
- 👖 Conformal Tights adds conformal prediction of coherent quantiles and intervals to any scikit-learn regressor or Darts forecaster☆108Updated last month
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Build Low Code Automated Tensorflow explainable models in just 3 lines of code. Library created by: Hasan Rafiq - https://www.linkedin.co…☆181Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆76Updated 4 months ago
- An abstraction layer for parameter tuning☆35Updated 7 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆75Updated 11 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆50Updated last year
- ☆115Updated last year