pierrenodet / spark-ensemble
Ensemble Learning for Apache Spark 🌲
☆22Updated 2 weeks ago
Related projects: ⓘ
- A simplified version of featuretools for Spark☆30Updated 5 years ago
- PMML scoring library for Spark as SparkML Transformer☆19Updated 2 weeks ago
- Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Mo…☆191Updated 3 years ago
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆134Updated 2 years ago
- real-time data + ML pipeline☆54Updated this week
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- Machine learning enhancements to Spark MlLib☆20Updated 9 years ago
- example how to perform distributed bayesian optimisation (autoML) using optuna on metaflow☆10Updated 2 years ago
- SparklingGraph documentation☆10Updated 4 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 5 years ago
- A JVM interface 🌯 for LightGBM, written in Scala, for inference in production.☆14Updated last week
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30Updated 5 years ago
- Python PMML scoring library for PySpark as SparkML Transformer☆21Updated 2 weeks ago
- A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm with support for exporting in ONNX format.☆224Updated 2 weeks ago
- A Python wrapper for XGBoost4J-Spark classes.☆46Updated 5 months ago
- Spark ML implementation of SOM algorithm (Kohonen self-organizing map)☆17Updated 2 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆49Updated 6 years ago
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- Helpers for scikit learn☆16Updated last year
- Distribution transparent Machine Learning experiments on Apache Spark☆89Updated 7 months ago
- Spark ML Lib serving library☆48Updated 6 years ago
- Spark Time Series Set data analysis☆12Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year
- Joblib Apache Spark Backend☆242Updated last month
- Train TensorFlow models on YARN in just a few lines of code!☆86Updated 10 months ago
- ☆11Updated 4 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆50Updated 2 years ago
- Some notes/codes on hyperparameters tuning techniques with some hacking around...☆24Updated 6 years ago
- Randomized SVD of large sparse matrices on Spark☆77Updated 2 years ago