apache/systemds

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/systemds)

apache / systemds

An open source ML system for the end-to-end data science lifecycle

☆1,096

Alternatives and similar repositories for systemds

Users that are interested in systemds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / systemds-website
View on GitHub
Apache SystemDS Website
☆21May 15, 2026Updated 2 months ago
apache / bahir-website
View on GitHub
Mirror of Apache Bahir Website
☆19Apr 25, 2025Updated last year
apache / bahir
View on GitHub
Mirror of Apache Bahir
☆336Jul 7, 2023Updated 3 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
h2oai / sparkling-water
View on GitHub
Sparkling Water provides H2O functionality inside Spark cluster
☆979Nov 5, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / incubator-samoa
View on GitHub
Mirror of Apache Samoa (Incubating)
☆251May 15, 2026Updated 2 months ago
yahoo / CaffeOnSpark
View on GitHub
Distributed deep learning on Hadoop and Spark clusters.
☆1,261Nov 15, 2019Updated 6 years ago
microsoft / DMTK
View on GitHub
Microsoft Distributed Machine Learning Toolkit
☆2,739Sep 12, 2018Updated 7 years ago
apache / incubator-toree
View on GitHub
Mirror of Apache Toree (Incubating)
☆751Updated this week
emmalanguage / emma
View on GitHub
A quotation-based Scala DSL for scalable data analysis.
☆65Jul 7, 2022Updated 4 years ago
apache / predictionio
View on GitHub
PredictionIO, a machine learning server for developers and ML engineers.
☆12,521Jan 9, 2021Updated 5 years ago
linkedin / photon-ml
View on GitHub
A scalable machine learning library on Apache Spark
☆797Aug 30, 2021Updated 4 years ago
amplab / SparkNet
View on GitHub
Distributed Neural Networks for Spark
☆609Jul 23, 2020Updated 5 years ago
OryxProject / oryx
View on GitHub
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
☆1,783Aug 16, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yahoo / TensorFlowOnSpark
View on GitHub
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
☆3,846Jul 10, 2023Updated 3 years ago
deeplearning4j / deeplearning4j
View on GitHub
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and …
☆14,243Updated this week
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,644Updated this week
CODAIT / redrock
View on GitHub
RedRock - Mobile Application prototype using Apache Spark, Twitter and Elasticsearch
☆15Sep 10, 2018Updated 7 years ago
intel / ipex-llm
View on GitHub
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…
☆8,864Jan 28, 2026Updated 5 months ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
CODAIT / r4ml
View on GitHub
Scalable R for Machine Learning
☆43Sep 11, 2018Updated 7 years ago
airbnb / aerosolve
View on GitHub
A machine learning package built for humans.
☆4,809Nov 6, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
combust / mleap
View on GitHub
MLeap: Deploy ML Pipelines to Production
☆1,539Jul 10, 2026Updated last week
h2oai / h2o-3
View on GitHub
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…
☆7,500Updated this week
TrueCar / mleap
View on GitHub
MLeap allows for easily putting Spark ML pipelines into production
☆78Oct 27, 2016Updated 9 years ago
huawei-noah / streamDM
View on GitHub
Stream Data Mining Library for Spark Streaming
☆497Apr 16, 2023Updated 3 years ago
databricks / spark-deep-learning
View on GitHub
Deep Learning Pipelines for Apache Spark
☆1,989Mar 30, 2023Updated 3 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
apache / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆43,658Updated this week
scalanlp / breeze
View on GitHub
Breeze is/was a numerical processing library for Scala.
☆3,454Oct 4, 2025Updated 9 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,008Oct 5, 2022Updated 3 years ago
apache / mahout
View on GitHub
Apache Mahout - an environment for quickly creating scalable, performant machine learning applications.
☆2,298Updated this week
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago
apache / datasketches-java
View on GitHub
A software library of stochastic streaming algorithms, a.k.a. sketches.
☆957Updated this week
databricks / spark-corenlp
View on GitHub
Stanford CoreNLP wrapper for Apache Spark
☆419Nov 15, 2018Updated 7 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
IBM / MAX-Spatial-Transformer-Network
View on GitHub
Train a neural network component that can add spatial transformations such as translation and rotation to larger models.
☆10Apr 18, 2019Updated 7 years ago