Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)
☆74Nov 9, 2023Updated 2 years ago
Alternatives and similar repositories for sparklingml
Users that are interested in sparklingml are comparing it to the libraries listed below
Sorting:
- A library for exporting Spark ML models and pipelines to PFA☆55Nov 21, 2018Updated 7 years ago
- A small java library for NLP Interchange Format (NIF) for NER(D) systems☆10Sep 13, 2022Updated 3 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- ☆11Apr 24, 2018Updated 7 years ago
- Active learning of GP hyperparameters following Garnett, et al., "Active Learning of Linear Embeddings for Gaussian Processes," (UAI 2014…☆16Aug 4, 2017Updated 8 years ago
- Examples for the Activate conference☆11Sep 11, 2019Updated 6 years ago
- Some examples to demonstrate using the threejs framework from JSweet.☆11Dec 10, 2019Updated 6 years ago
- ML Featurizer is a library to enable users to create additional features from raw data with ease☆14Apr 8, 2024Updated last year
- Resources for 3D Deep Learning☆12Sep 7, 2017Updated 8 years ago
- Engineering Drawing Parser☆10Jan 24, 2019Updated 7 years ago
- Graph Challenge☆33Aug 19, 2019Updated 6 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 7 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Demo Spark application to transform data gathered on sensors for a heatmap application☆33May 29, 2017Updated 8 years ago
- Similarity encoding of dirty categorical variables (strings)☆20Jan 22, 2019Updated 7 years ago
- Functional Language extending the enriched effect calculus. Linear usage of effects. Implemented with Scala 3.☆21Aug 5, 2025Updated 7 months ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Samples for the Obj library☆15Feb 12, 2018Updated 8 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Jul 15, 2020Updated 5 years ago
- Scripts for building Cloudera Manager parcel and CSD for Livy Spark Server☆21Oct 18, 2017Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- A Time Series Library for Apache Spark☆1,022Jul 3, 2020Updated 5 years ago
- SANSA Machine Learning Layer☆39Oct 8, 2020Updated 5 years ago
- Helm chart: single-node, pseudo-distributed, kerberized, hadoop cluster: K8S☆20Feb 14, 2018Updated 8 years ago
- Relation Schema Induction using SICTF☆16Sep 20, 2018Updated 7 years ago
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- An example of running Apache Spark using Scala in ipython notebook☆140Aug 31, 2015Updated 10 years ago
- Markdown String Interpolator for Scala 3☆19Jan 18, 2022Updated 4 years ago
- Library for functional graph & geometry algorithms☆21Apr 22, 2019Updated 6 years ago
- A modern networking framework based on ucx for Java 19+☆27Nov 23, 2023Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- MTP - My Test Project - example Java applications with Spring Boot, Apache Spark Streaming, Apache Cassandra, Apache Kafka, Akka, Angular…☆20Jan 1, 2017Updated 9 years ago
- Real-time query spark and visualise it as graph.☆24Oct 4, 2017Updated 8 years ago
- Minimal Lucene app with custom tokenization and analysis☆19Apr 15, 2015Updated 10 years ago
- Simple example of Solr Block Joins between Parents and Children, implemented in SolrJ☆22Jul 2, 2014Updated 11 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆128Sep 8, 2025Updated 6 months ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20May 13, 2018Updated 7 years ago
- JavaScript Programming Language for Solid Modeling☆47Aug 28, 2014Updated 11 years ago
- Live-updating Spark UI built with Meteor☆190Apr 6, 2021Updated 4 years ago