ChuckWoodraska / EurekaTrees
Visualizes the Random Forest debug string from the MLLib in Spark using D3.js
β37Updated 2 years ago
Alternatives and similar repositories for EurekaTrees:
Users that are interested in EurekaTrees are comparing it to the libraries listed below
- π² Decision Tree Visualization for Apache Sparkβ50Updated 5 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.β65Updated 7 years ago
- Spark 2.0 Scala Machine Learning examplesβ77Updated 5 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and Referencesβ69Updated 6 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.β103Updated 5 years ago
- Spark 2.0 Python Machine Learning examplesβ97Updated 5 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Sβ¦β66Updated 9 years ago
- PySpark Machine Learning Examplesβ44Updated 6 years ago
- MLFlow Spark Summit 2019 Presentationβ67Updated 5 years ago
- MLflow samples - deprecatedβ22Updated last year
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.β49Updated 6 years ago
- My machine learning model for the See Click Predict Fix Kaggle competitionβ31Updated 7 years ago
- A simple introduction to using spark ml pipelinesβ26Updated 6 years ago
- DBSCAN implementation using Apache Sparkβ48Updated 7 years ago
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learβ¦β30Updated 6 years ago
- An Apache Spark-shell backend for IPythonβ105Updated 3 years ago
- Python library for converting Apache Spark ML pipelines to PMMLβ96Updated this week
- Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)β24Updated 8 years ago
- Oracle Data Science Bootcamp 2014β25Updated 9 years ago
- β77Updated 8 years ago
- Installation guide for Apache Spark + Hadoop on Mac/Linuxβ59Updated 7 years ago
- Updated repositoryβ157Updated 3 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Sparkβ147Updated 9 years ago
- Training materials for Strata, AMP Camp, etcβ150Updated 9 years ago
- Create HTML profiling reports from Apache Spark DataFramesβ195Updated 5 years ago
- PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)β94Updated 2 years ago
- Learn the pyspark API through pictures and simple examplesβ170Updated 4 years ago
- A simple tool for plotting Spark ML's Decision Treesβ41Updated 3 years ago
- Example unit tests for Apache Spark Python scripts using the py.test frameworkβ84Updated 8 years ago
- Source material for Data Science for Telecom Tutorial at Strata Singapore 2015β102Updated 8 years ago