logicalclocks / hops-util-py
Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai
☆27Updated 3 months ago
Related projects: ⓘ
- Distribution transparent Machine Learning experiments on Apache Spark☆89Updated 6 months ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆53Updated last week
- Documentation for Hopsworks and Hops☆11Updated 2 years ago
- Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Mo…☆191Updated 3 years ago
- Asynchronous actions for PySpark☆44Updated 2 years ago
- Comet-For-MLFlow Extension☆55Updated 3 months ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 10 months ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- XGBoost GPU accelerated on Spark example applications☆51Updated 2 years ago
- ☆30Updated 2 years ago
- PySpark phonetic and string matching algorithms☆35Updated 7 months ago
- This is a collection of MLflow examples that you can directly run with mlflow command☆30Updated 4 years ago
- Scaling Python Machine Learning☆44Updated last year
- Deploy dask on YARN clusters☆69Updated last month
- real-time data + ML pipeline☆54Updated this week
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 5 years ago
- Tools for MLflow☆36Updated 7 months ago
- A Scalable Auto-ML System☆51Updated last year
- Monitor Apache Spark from Jupyter Notebook☆172Updated 2 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆100Updated 5 years ago
- ☆33Updated 4 years ago
- ElasticSearch implementation of MlFlow tracking store☆16Updated 3 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- A simple tool for plotting Spark ML's Decision Trees☆41Updated 2 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.☆43Updated last year