pierrenodet / spark-ensemble
Ensemble Learning for Apache Spark 🌲
☆23Updated 7 months ago
Alternatives and similar repositories for spark-ensemble:
Users that are interested in spark-ensemble are comparing it to the libraries listed below
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Helpers for scikit learn☆16Updated 2 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.☆82Updated 3 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- Train TensorFlow models on YARN in just a few lines of code!☆88Updated last year
- An abstraction layer for parameter tuning☆35Updated 7 months ago
- The stream-learn is an open-source Python library for difficult data stream analysis.☆63Updated last month
- AutoBazaar: An AutoML System from the Machine Learning Bazaar☆33Updated 3 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆43Updated 4 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30Updated 5 years ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Python library to explain Tree Ensemble models (TE) like XGBoost, using a rule list.☆53Updated last year
- TSFresh primitives for featuretools☆36Updated 2 years ago
- An automated machine learning tool aimed to facilitate AutoML research.☆98Updated 7 months ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- A Tree Search Library for Data Cleaning☆22Updated 3 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆134Updated 2 years ago
- Spark Parameter Optimization and Tuning☆31Updated 7 years ago
- A Python wrapper for XGBoost4J-Spark classes.☆47Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Joblib Apache Spark Backend☆245Updated 2 weeks ago
- real-time data + ML pipeline☆54Updated 2 weeks ago
- Spark functions to run popular phonetic and string matching algorithms☆60Updated 3 years ago
- General Interpretability Package☆58Updated 2 years ago