pierrenodet / spark-ensemble
Ensemble Learning for Apache Spark 🌲
☆23Updated 4 months ago
Alternatives and similar repositories for spark-ensemble:
Users that are interested in spark-ensemble are comparing it to the libraries listed below
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- Record matching and entity resolution at scale in Spark☆32Updated last year
- Python PMML scoring library for PySpark as SparkML Transformer☆22Updated last month
- SOUL: Scala Oversampling and Undersampling Library.☆13Updated 5 years ago
- real-time data + ML pipeline☆54Updated this week
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- TSFresh primitives for featuretools☆36Updated 2 years ago
- Data Lineage Tracing Library☆22Updated 3 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 9 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- A Python wrapper for XGBoost4J-Spark classes.☆47Updated 9 months ago
- Helpers for scikit learn☆16Updated 2 years ago
- Spark ML implementation of SOM algorithm (Kohonen self-organizing map)☆17Updated 2 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30Updated 5 years ago
- Sample application running fbprophet using spark☆49Updated 5 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆35Updated last month
- Documentation for Hopsworks and Hops☆11Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 9 months ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆130Updated 3 years ago
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆134Updated 2 years ago
- Probabilistic Multivariate Time Series Forecast using Deep Learning☆95Updated 5 years ago
- Dataiku DSS plugin to automate time series forecasting with Deep Learning and statistical models 📈☆18Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Spark ML Lib serving library☆48Updated 6 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated last year