clarifyhealth / spark_feature_selectorLinks
☆12Updated last year
Alternatives and similar repositories for spark_feature_selector
Users that are interested in spark_feature_selector are comparing it to the libraries listed below
Sorting:
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆196Updated 6 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆198Updated 5 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Spark implementation of computing Shapley Values using monte-carlo approximation☆76Updated 2 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- ☆107Updated 2 years ago
- Implementations of the Portable Format for Analytics (PFA)☆127Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆71Updated 5 years ago
- Resources for Data Science Process management☆204Updated 5 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- ☆96Updated 5 years ago
- Deploy AutoML as a service using Flask☆226Updated 8 years ago
- Easy to use library to bring Tensorflow on Apache Spark☆296Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆249Updated last month
- Joblib Apache Spark Backend☆249Updated 6 months ago
- Featureselection methods as Spark MLlib Pipelines☆31Updated 7 years ago
- MLOps Platform☆272Updated 11 months ago
- A collaborative feature engineering system built on JupyterHub☆94Updated 6 years ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Updated last year
- Python client library for the Openscoring REST web service☆32Updated 3 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- Apache Marvin-AI☆100Updated 2 years ago
- python automatic data quality check toolkit☆282Updated 5 years ago
- Spark Implementation of Google Facets Overview https://github.com/PAIR-code/facets☆55Updated 2 years ago