Simple and Distributed Machine Learning
☆5,229May 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for SynapseML
Users that are interested in SynapseML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MLeap: Deploy ML Pipelines to Production☆1,535Mar 10, 2026Updated 2 months ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,855Jul 10, 2023Updated 2 years ago
- State of the Art Natural Language Processing☆4,129May 16, 2026Updated last week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, a…☆26,072Updated this week
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆18,355May 17, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,275Sep 29, 2023Updated 2 years ago
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,805Jan 28, 2026Updated 3 months ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,809May 16, 2026Updated last week
- An open source python library for automated feature engineering☆7,646Feb 3, 2026Updated 3 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,690Dec 1, 2025Updated 5 months ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆28,390May 14, 2026Updated last week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,889Jan 2, 2026Updated 4 months ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 6 months ago
- Apache Spark - A unified analytics engine for large-scale data processing☆43,311Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Best Practices on Recommendation Systems☆21,713Updated this week
- Breeze is/was a numerical processing library for Scala.☆3,454Oct 4, 2025Updated 7 months ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆42,616Updated this week
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,356Jul 3, 2024Updated last year
- Fit interpretable models. Explain blackbox machine learning.☆6,852Updated this week
- The Open Source Feature Store for AI/ML☆7,042May 15, 2026Updated last week
- A game theoretic approach to explain the output of any machine learning model.☆25,431May 16, 2026Updated last week
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,482Updated this week
- Machine Learning Toolkit for Kubernetes☆15,639May 7, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,695May 8, 2026Updated 2 weeks ago
- A library for efficient similarity search and clustering of dense vectors.☆40,061May 15, 2026Updated last week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,748Mar 23, 2026Updated 2 months ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,617May 14, 2026Updated last week
- Parallel computing with task scheduling☆13,834May 14, 2026Updated last week
- A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other ma…☆8,952Updated this week
- Build, Manage and Deploy AI/ML Systems☆10,105Updated this week
- A better notebook for Scala (and more)☆4,595Jan 27, 2026Updated 3 months ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,789May 8, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆20,188May 8, 2026Updated 2 weeks ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆8,654May 7, 2026Updated 2 weeks ago
- A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.☆4,349May 17, 2026Updated last week
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,702Updated this week
- REST job server for Apache Spark☆2,843Mar 3, 2026Updated 2 months ago
- Hummingbird compiles trained ML models into tensor computation for faster inference.☆3,536Jul 17, 2025Updated 10 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,361Sep 9, 2025Updated 8 months ago