clarifyhealth / spark_feature_selector
☆12Updated 6 months ago
Alternatives and similar repositories for spark_feature_selector:
Users that are interested in spark_feature_selector are comparing it to the libraries listed below
- ☆8Updated 4 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- ☆96Updated 5 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- ☆106Updated 2 years ago
- Joblib Apache Spark Backend☆245Updated 3 weeks ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated 2 months ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆193Updated 5 years ago
- Data Quality Monitoring Tool☆16Updated 7 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- A collaborative feature engineering system built on JupyterHub☆94Updated 6 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- Brand Propensity Model for Retailers☆11Updated 2 years ago
- ☆30Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- ☆25Updated 6 years ago
- Repository of sample Databricks notebooks☆258Updated last year
- ☆16Updated last year
- ☆54Updated 7 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- MLflow samples - deprecated☆22Updated last year
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated 2 years ago
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.☆102Updated 3 years ago