OryxProject/oryx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OryxProject/oryx)

OryxProject / oryx

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

☆1,783

Alternatives and similar repositories for oryx

Users that are interested in oryx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / predictionio
View on GitHub
PredictionIO, a machine learning server for developers and ML engineers.
☆12,521Jan 9, 2021Updated 5 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,838Mar 3, 2026Updated 4 months ago
h2oai / sparkling-water
View on GitHub
Sparkling Water provides H2O functionality inside Spark cluster
☆979Nov 5, 2025Updated 8 months ago
SeldonIO / seldon-server
View on GitHub
Machine Learning Platform and Recommendation Engine built on Kubernetes
☆1,478Apr 12, 2020Updated 6 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
sryza / aas
View on GitHub
Code to accompany Advanced Analytics with Spark from O'Reilly Media
☆1,523Sep 25, 2024Updated last year
filodb / FiloDB
View on GitHub
Distributed Prometheus time series database
☆1,468Updated this week
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
apache / mahout
View on GitHub
Apache Mahout - an environment for quickly creating scalable, performant machine learning applications.
☆2,298Updated this week
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,008Oct 5, 2022Updated 3 years ago
Stratio / sparta
View on GitHub
Real Time Analytics and Data Pipelines based on Spark Streaming
☆530Oct 24, 2019Updated 6 years ago
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,644Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
linkedin / photon-ml
View on GitHub
A scalable machine learning library on Apache Spark
☆797Aug 30, 2021Updated 4 years ago
actionml / universal-recommender
View on GitHub
Highly configurable recommender based on PredictionIO and Mahout's Correlated Cross-Occurrence algorithm
☆673Jun 18, 2019Updated 7 years ago
openscoring / openscoring
View on GitHub
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
☆588Feb 2, 2026Updated 5 months ago
airbnb / aerosolve
View on GitHub
A machine learning package built for humans.
☆4,809Nov 6, 2025Updated 8 months ago
deeplearning4j / deeplearning4j
View on GitHub
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and …
☆14,243Updated this week
krasserm / akka-analytics
View on GitHub
Large-scale event processing with Akka Persistence and Apache Spark
☆271Jun 18, 2016Updated 10 years ago
databricks / spark-avro
View on GitHub
Avro Data Source for Apache Spark
☆537Dec 19, 2018Updated 7 years ago
myrrix / myrrix-recommender
View on GitHub
Stand-alone recommender system from Myrrix
☆111Dec 17, 2023Updated 2 years ago
yahoo / TensorFlowOnSpark
View on GitHub
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
☆3,846Jul 10, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
scalanlp / breeze
View on GitHub
Breeze is/was a numerical processing library for Scala.
☆3,454Oct 4, 2025Updated 9 months ago
databricks / tensorframes
View on GitHub
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
☆744Jul 30, 2024Updated last year
h2oai / h2o-3
View on GitHub
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…
☆7,500Updated this week
twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,522May 28, 2023Updated 3 years ago
h2oai / h2o-2
View on GitHub
Please visit https://github.com/h2oai/h2o-3 for latest H2O
☆2,254Oct 24, 2024Updated last year
lenskit / lenskit
View on GitHub
LensKit recommender toolkit.
☆972Aug 23, 2021Updated 4 years ago
databricks / spark-corenlp
View on GitHub
Stanford CoreNLP wrapper for Apache Spark
☆419Nov 15, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
intel / ipex-llm
View on GitHub
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…
☆8,864Jan 28, 2026Updated 5 months ago
apache / incubator-toree
View on GitHub
Mirror of Apache Toree (Incubating)
☆751Updated this week
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
databricks / spark-deep-learning
View on GitHub
Deep Learning Pipelines for Apache Spark
☆1,989Mar 30, 2023Updated 3 years ago
amplab / SparkNet
View on GitHub
Distributed Neural Networks for Spark
☆609Jul 23, 2020Updated 5 years ago
apache / druid
View on GitHub
Apache Druid: a high performance real-time analytics database.
☆14,033Updated this week
yahoo / CaffeOnSpark
View on GitHub
Distributed deep learning on Hadoop and Spark clusters.
☆1,261Nov 15, 2019Updated 6 years ago