giantcroc / featuretoolsOnSpark
A simplified version of featuretools for Spark
☆30Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for featuretoolsOnSpark
- Run FeatureTools to automate Feature Engineering distributionally on Spark.☆11Updated 6 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 9 years ago
- A Python wrapper for XGBoost4J-Spark classes.☆47Updated 6 months ago
- Python library for converting Apache Spark ML pipelines to PMML☆95Updated 10 months ago
- JPMML-SparkML plugin for converting LightGBM-Spark models to PMML☆41Updated 3 years ago
- Some notes/codes on hyperparameters tuning techniques with some hacking around...☆24Updated 6 years ago
- Spark-based GBM☆56Updated 4 years ago
- Python client library for the Openscoring REST web service☆32Updated 2 years ago
- ☆27Updated 3 years ago
- Public repository made for Automated Feature Engineering workshop (Summer Data Conf, Odessa, 2018-07-21)☆19Updated 6 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- A collaborative feature engineering system built on JupyterHub☆94Updated 5 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆31Updated 6 years ago
- ☆34Updated 6 years ago
- Python PMML scoring library for PySpark as SparkML Transformer☆22Updated 2 weeks ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆50Updated 2 years ago
- JPMML-SparkML plugin for converting XGBoost4J-Spark models to PMML☆36Updated 4 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 5 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆28Updated last year
- Java library and command-line application for converting XGBoost models to PMML☆128Updated 2 months ago
- SmartML: Supervised Machine Learning Automation in R☆24Updated 3 years ago
- An automatic machine learning toolkit, including hyper-parameter tuning and feature engineering.☆58Updated 5 years ago
- A Python implementation of "Shapley Value Methods for Attribution Modeling in Online Advertising" by Zhao, et al.☆35Updated 4 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆49Updated 6 years ago
- Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)☆24Updated 7 years ago
- PMML scoring library for Scala☆62Updated 2 weeks ago
- ☆74Updated 6 years ago
- Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).☆184Updated 9 months ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated last year
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Updated 6 years ago