mattkearns / automated-data-preprocessing
A command-line utility program for automating the trivial, frequently occurring data preparation tasks: missing value interpolation, outlier removal, and encoding categorical variables.
☆36Updated 6 years ago
Alternatives and similar repositories for automated-data-preprocessing:
Users that are interested in automated-data-preprocessing are comparing it to the libraries listed below
- Examples how MLJAR can be used☆59Updated last year
- OptimalFlow is an omni-ensemble and scalable automated machine learning Python toolkit, which uses Pipeline Cluster Traversal Experiments…☆27Updated last year
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆64Updated last month
- Predict whether or not a patient will show up to their next appointment using automated feature engineering☆29Updated 4 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Smart, automatic detection and stationarization of non-stationary time series data.☆29Updated 2 years ago
- A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning …☆44Updated 2 years ago
- CentOS based Docker container for Time Series Analysis and Modeling.☆21Updated 5 years ago
- Code and notebooks containing my experiments in data science, EDA, visualization, and machine learning☆27Updated last year
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest…☆58Updated 3 years ago
- Model to accurately forecast inventory demand based on historical sales data.☆60Updated 8 years ago
- Jupyter Notebooks for supplychainpy☆22Updated 8 years ago
- An open-source unsupervised time-series anomaly detection package by Getcontact Data Team☆44Updated 2 years ago
- ☆19Updated 4 years ago
- Build tensorflow keras model pipelines in a single line of code. Now with mlflow tracking. Created by Ram Seshadri. Collaborators welcome…☆121Updated 10 months ago
- ☆14Updated 5 years ago
- Data Analysis and Machine Learning with Python: EDA with ECDF and Correlation analysis, Preprocessing and Feature engineering, L1 (Lasso)…☆32Updated 7 years ago
- PySpark, Databrick, h2o, MLlib☆18Updated 8 years ago
- Python package for Bayesian & Frequentist A/B Testing☆12Updated last year
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Predicting the Likelihood to Purchase a Financial Product Following a Direct Marketing Campaign☆27Updated 2 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib☆19Updated 3 years ago