Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)
☆73Nov 9, 2023Updated 2 years ago
Alternatives and similar repositories for sparklingml
Users that are interested in sparklingml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple introduction to using spark ml pipelines☆25Apr 5, 2018Updated 8 years ago
- A library for exporting Spark ML models and pipelines to PFA☆55Nov 21, 2018Updated 7 years ago
- Examples for the Activate conference☆11Sep 11, 2019Updated 6 years ago
- A small java library for NLP Interchange Format (NIF) for NER(D) systems☆10Sep 13, 2022Updated 3 years ago
- Source code for the Style Similarity project: measure style similarity between 3D shapes.☆14Mar 10, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Way to run Uima Pipelines on Apache Spark☆10Jul 19, 2021Updated 4 years ago
- Examples of spark-lucenerdd☆15Oct 6, 2023Updated 2 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 7 years ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Text Mining in Python☆22Nov 23, 2016Updated 9 years ago
- ☆20Apr 25, 2021Updated 4 years ago
- ☆21May 5, 2016Updated 9 years ago
- Exploiting SNP correlations within Random Forest for Genome-Wide Association Studies☆13Oct 20, 2014Updated 11 years ago
- OccJava - A SWIG-generated Java wrapper for OpenCascade☆19Mar 22, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scala Algebra System☆17Feb 27, 2026Updated last month
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- Java library to read and write PLY files.☆24Apr 8, 2021Updated 5 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆130Sep 8, 2025Updated 7 months ago
- Generate English-language text similar to the news articles in the One Billion Words data set.☆26Sep 17, 2025Updated 7 months ago
- MLeap: Deploy ML Pipelines to Production☆1,536Mar 10, 2026Updated last month
- Dockerfiles maintained by Trivadis Platform Factory☆12Mar 13, 2020Updated 6 years ago
- Demo Spark application to transform data gathered on sensors for a heatmap application☆33May 29, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Analytic UIMA pipelines using Spark☆24Nov 27, 2015Updated 10 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- ☆14Aug 24, 2021Updated 4 years ago
- Train a neural network component that can add spatial transformations such as translation and rotation to larger models.☆10Apr 18, 2019Updated 7 years ago
- The Ocean Tensor Package provides a comprehensive set of tensor operations for CPU and GPU. The functions are available directly as a C l…☆24Jul 1, 2019Updated 6 years ago
- An example of running Apache Spark using Scala in ipython notebook☆140Aug 31, 2015Updated 10 years ago
- Python bindings for Apache Tika☆24Aug 20, 2020Updated 5 years ago
- SANSA Machine Learning Layer☆39Oct 8, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Time Series Library for Apache Spark☆1,024Jul 3, 2020Updated 5 years ago
- Modern Nipype Workflow Management based on Lyman☆17Aug 3, 2022Updated 3 years ago
- Files for the Karma tutorial at TCDL, Texas Conference on Digital Libraries☆29Apr 17, 2016Updated 10 years ago
- ☆22Jul 3, 2015Updated 10 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year
- ☆10Jul 6, 2018Updated 7 years ago
- A simple REST-server around ctakes clinical pipeline.☆30Feb 10, 2017Updated 9 years ago