Sparkling Water provides H2O functionality inside Spark cluster
☆977Nov 5, 2025Updated 4 months ago
Alternatives and similar repositories for sparkling-water
Users that are interested in sparkling-water are comparing it to the libraries listed below
Sorting:
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,506Updated this week
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Nov 25, 2014Updated 11 years ago
- Deep Learning in H2O using Native GPU Backends☆282Feb 20, 2018Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Tutorials and training material for the H2O Machine Learning Platform☆1,502Oct 24, 2024Updated last year
- Web based interactive computing environment for H2O☆143Oct 24, 2024Updated last year
- Interactive and Reactive Data Science using Scala and Spark.☆3,151May 16, 2023Updated 2 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Apr 6, 2019Updated 6 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 27, 2026Updated last week
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- training material☆47Oct 24, 2024Updated last year
- H2Oai GPU Edition☆467Oct 24, 2024Updated last year
- Distributed deep learning on Hadoop and Spark clusters.☆1,262Nov 15, 2019Updated 6 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,859Jul 10, 2023Updated 2 years ago
- REST job server for Apache Spark☆2,843Jul 8, 2025Updated 8 months ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago
- Templates for projects based on top of H2O.☆38Mar 17, 2025Updated 11 months ago
- A scalable machine learning library on Apache Spark☆796Aug 30, 2021Updated 4 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- MLeap: Deploy ML Pipelines to Production☆1,536Jan 12, 2026Updated last month
- RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)☆62Jul 19, 2018Updated 7 years ago
- Presentations from H2O meetups & conferences by the H2O.ai team☆408Oct 29, 2025Updated 4 months ago
- Drizzle integration with Apache Spark☆120Sep 11, 2018Updated 7 years ago
- An open source ML system for the end-to-end data science lifecycle☆1,079Mar 2, 2026Updated last week
- Simple and Distributed Machine Learning☆5,207Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,605Mar 1, 2026Updated last week
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,893Sep 16, 2022Updated 3 years ago
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,705Jan 28, 2026Updated last month
- A collection of data science examples implemented across a variety of languages and libraries.☆34Jan 14, 2016Updated 10 years ago
- Base classes to use when writing tests with Spark☆1,550Dec 22, 2025Updated 2 months ago
- PredictionIO, a machine learning server for developers and ML engineers.☆12,528Jan 9, 2021Updated 5 years ago
- Breeze is/was a numerical processing library for Scala.☆3,458Oct 4, 2025Updated 5 months ago
- Mirror of Apache Apex core☆350Jun 7, 2021Updated 4 years ago
- Apache Spark to Apache Cassandra connector☆1,951Apr 29, 2025Updated 10 months ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,526Sep 25, 2024Updated last year
- Distributed Prometheus time series database☆1,461Mar 3, 2026Updated last week