sparklingpandas/sparklingml

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sparklingpandas/sparklingml)

sparklingpandas / sparklingml

Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)

☆73

Alternatives and similar repositories for sparklingml

Users that are interested in sparklingml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

holdenk / spark-intro-ml-pipeline-workshop
View on GitHub
A simple introduction to using spark ml pipelines
☆25Apr 5, 2018Updated 8 years ago
CODAIT / aardpfark
View on GitHub
A library for exporting Spark ML models and pipelines to PFA
☆55Nov 21, 2018Updated 7 years ago
pcodding / hadoop_ctakes
View on GitHub
Hadoop integration code for working with with Apache cTAKES
☆10Feb 11, 2014Updated 12 years ago
cincheo / jsweet-examples-threejs
View on GitHub
Some examples to demonstrate using the threejs framework from JSweet.
☆11Dec 10, 2019Updated 6 years ago
NLP2RDF / NIF-lib
View on GitHub
A small java library for NLP Interchange Format (NIF) for NER(D) systems
☆10Sep 13, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Benjamin-Hu / Engineering-Drawing-Parser
View on GitHub
Engineering Drawing Parser
☆13Jan 24, 2019Updated 7 years ago
adobe / ml-featurizer
View on GitHub
ML Featurizer is a library to enable users to create additional features from raw data with ease
☆15Apr 8, 2024Updated 2 years ago
minhptx / iswc-2016-semantic-labeling
View on GitHub
☆11Apr 24, 2018Updated 8 years ago
SteveJunGao / 3D_DeepLearning_Resources
View on GitHub
Resources for 3D Deep Learning
☆12Sep 7, 2017Updated 8 years ago
aphp / UimaOnSpark
View on GitHub
Way to run Uima Pipelines on Apache Spark
☆10Jul 19, 2021Updated 5 years ago
zouzias / spark-lucenerdd-examples
View on GitHub
Examples of spark-lucenerdd
☆15Oct 6, 2023Updated 2 years ago
javagl / ObjSamples
View on GitHub
Samples for the Obj library
☆15Feb 12, 2018Updated 8 years ago
MarcKaminski / spark-FeatureSelection
View on GitHub
Featureselection methods as Spark MLlib Pipelines
☆30Apr 29, 2018Updated 8 years ago
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jamartinh / Orange3-Spark
View on GitHub
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Dec 24, 2016Updated 9 years ago
OntoBREP / occjava
View on GitHub
OccJava - A SWIG-generated Java wrapper for OpenCascade
☆19Mar 22, 2016Updated 10 years ago
rjolly / scas
View on GitHub
Scala Algebra System
☆17Jul 23, 2026Updated last week
oaqa / suim
View on GitHub
Analytic UIMA pipelines using Spark
☆24Nov 27, 2015Updated 10 years ago
zouzias / spark-lucenerdd
View on GitHub
Spark RDD with Lucene's query and entity linkage capabilities
☆129Jun 23, 2026Updated last month
combust / mleap
View on GitHub
MLeap: Deploy ML Pipelines to Production
☆1,539Jul 21, 2026Updated last week
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
oreillymedia / Learning-Path-Get-Started-with-Natural-Language-Processing-Using-Python-Spark-and-Scala
View on GitHub
Links to example code downloads for Learning Path: Get Started with Natural Language Processing Using Python, Spark, and Scala
☆16Feb 23, 2017Updated 9 years ago
Kami / python-file-syncer
View on GitHub
Python program which synchronizes files from a local directory to one of the cloud object storage providers supported by Libcloud and vic…
☆19Sep 1, 2014Updated 11 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
squito / spark-memory
View on GitHub
A tool to get better debug info on spark's memory usage
☆42Aug 21, 2019Updated 6 years ago
fedelemantuano / tika-app-python
View on GitHub
Python bindings for Apache Tika
☆24Aug 20, 2020Updated 5 years ago
IBM / MAX-Spatial-Transformer-Network
View on GitHub
Train a neural network component that can add spatial transformations such as translation and rotation to larger models.
☆10Apr 18, 2019Updated 7 years ago
SANSA-Stack / Archived-SANSA-ML
View on GitHub
SANSA Machine Learning Layer
☆39Oct 8, 2020Updated 5 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
hohonuuli / sparknotebook
View on GitHub
An example of running Apache Spark using Scala in ipython notebook
☆141Aug 31, 2015Updated 10 years ago
graphchallenge / GraphChallenge
View on GitHub
Graph Challenge
☆33Aug 19, 2019Updated 6 years ago
lucidworks / solrj-nested-docs
View on GitHub
Simple example of Solr Block Joins between Parents and Children, implemented in SolrJ
☆22Jul 2, 2014Updated 12 years ago
IBM / ocean-tensor-package
View on GitHub
The Ocean Tensor Package provides a comprehensive set of tensor operations for CPU and GPU. The functions are available directly as a C l…
☆24Jul 1, 2019Updated 7 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
odinsbane / least-squares-in-java
View on GitHub
Java Least Squares fitting library.
☆26Aug 19, 2025Updated 11 months ago
kastman / fitz
View on GitHub
Modern Nipype Workflow Management based on Lyman
☆17Updated this week
lucidworks / solr-for-datascience
View on GitHub
☆24Oct 19, 2015Updated 10 years ago
twosigma / flint
View on GitHub
A Time Series Library for Apache Spark
☆1,173Jul 3, 2020Updated 6 years ago
smurn / jPLY
View on GitHub
Java library to read and write PLY files.
☆25Apr 8, 2021Updated 5 years ago
AlpineNow / SparkML2
View on GitHub
☆21May 5, 2016Updated 10 years ago
datamindedbe / lighthouse
View on GitHub
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…
☆64Sep 6, 2024Updated last year