jeongyoonlee / data-science-process-management
Resources for Data Science Process management
☆204Updated 5 years ago
Alternatives and similar repositories for data-science-process-management:
Users that are interested in data-science-process-management are comparing it to the libraries listed below
- Performance of various open source GBM implementations☆217Updated 10 months ago
- A package for parallelizing the fit and flexibly scoring of sklearn machine learning models, with visualization routines.☆199Updated last year
- Tutorial for a new versioning Machine Learning pipeline☆80Updated 3 years ago
- H2O.ai Machine Learning Interpretability Resources☆488Updated 4 years ago
- Lightweight, Python library for fast and reproducible experimentation☆134Updated 6 years ago
- autosklearn-zeroconf is a fully automated binary classifier. It is based on the AutoML challenge winner auto-sklearn. Give it a dataset w…☆169Updated 5 years ago
- Deploy AutoML as a service using Flask☆226Updated 7 years ago
- Open-source Python library for statistical analysis of randomised control trials (A/B tests)☆338Updated 2 years ago
- Personal data science and machine learning toolbox☆365Updated 5 years ago
- Basics of programming: algorithms, data structures, object oriented programming☆93Updated 6 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆165Updated 4 years ago
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.☆102Updated 3 years ago
- edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab☆224Updated 5 years ago
- Material of the Kaggle Berlin meetup group!☆35Updated 7 years ago
- Joblib Apache Spark Backend☆245Updated 3 weeks ago
- Data Analysis Baseline Library☆727Updated 4 months ago
- Repo for the ML_Insights python package☆150Updated 3 weeks ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆239Updated last month
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆193Updated 5 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated 3 months ago
- Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, …☆677Updated 10 months ago
- Example Python DS project☆71Updated 6 years ago
- Using Kafka-Python to illustrate a ML production pipeline☆110Updated 2 years ago
- python library for automated dataset normalization☆114Updated last year
- A collaborative feature engineering system built on JupyterHub☆94Updated 6 years ago
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆517Updated last week
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago