mapattacker / datascience
How to be a Data Scientist
☆34Updated 3 years ago
Alternatives and similar repositories for datascience:
Users that are interested in datascience are comparing it to the libraries listed below
- Applied Machine Learning with Python☆77Updated 10 months ago
- A tiny framework to perform adversarial validation of your training and test data.☆20Updated last month
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and related☆56Updated 3 years ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆65Updated 9 months ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆106Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.☆51Updated last year
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆64Updated 3 weeks ago
- Wrap-up to automatically tune xgboost in Python.☆79Updated 3 years ago
- The simplest way to deploy a machine learning model☆23Updated 2 years ago
- Notebook and slides for my talk at Pydata NYC 2018☆88Updated 8 months ago
- Developmental tools to detect data drift☆14Updated 11 months ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆104Updated 2 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Machine Learning encoders for feature transformation & engineering: target encoder, weight of evidence, label encoder.☆23Updated 4 years ago
- Learning statistics with Python☆52Updated 4 years ago
- Predicting the Likelihood to Purchase a Financial Product Following a Direct Marketing Campaign☆27Updated 2 years ago
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.☆42Updated last year
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- Assorted exercises and proof-of-concepts to understand and study machine learning and statistical learning theory☆45Updated 6 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated last year
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆61Updated 2 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- General Interpretability Package☆58Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- Example usage of scikit-hts☆57Updated 2 years ago
- A presention of core concepts and a data generator making easier using tabular data with TensorFlow and Keras☆41Updated last year
- Materials for conference talks and workshops☆31Updated last year
- Recurrent Neural Networks for Timeseries☆24Updated 5 years ago