Sparkling Water provides H2O functionality inside Spark cluster
☆977Nov 5, 2025Updated 5 months ago
Alternatives and similar repositories for sparkling-water
Users that are interested in sparkling-water are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,493Updated this week
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Nov 25, 2014Updated 11 years ago
- Deep Learning in H2O using Native GPU Backends☆282Feb 20, 2018Updated 8 years ago
- Web based interactive computing environment for H2O☆145Oct 24, 2024Updated last year
- Tutorials and training material for the H2O Machine Learning Platform☆1,503Oct 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 9 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Apr 6, 2019Updated 7 years ago
- A library for time series analysis on Apache Spark☆1,199Oct 13, 2020Updated 5 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,150May 16, 2023Updated 2 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Aug 16, 2021Updated 4 years ago
- training material☆47Oct 24, 2024Updated last year
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- Templates for projects based on top of H2O.☆39Mar 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mirror of Apache Toree (Incubating)☆749Apr 2, 2026Updated 2 weeks ago
- H2Oai GPU Edition☆466Oct 24, 2024Updated last year
- RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)☆62Jul 19, 2018Updated 7 years ago
- Distributed deep learning on Hadoop and Spark clusters.☆1,262Nov 15, 2019Updated 6 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,860Jul 10, 2023Updated 2 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated last month
- A collection of data science examples implemented across a variety of languages and libraries.☆34Jan 14, 2016Updated 10 years ago
- Presentations from H2O meetups & conferences by the H2O.ai team☆406Oct 29, 2025Updated 5 months ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,035Nov 21, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,009Oct 5, 2022Updated 3 years ago
- A scalable machine learning library on Apache Spark☆797Aug 30, 2021Updated 4 years ago
- MLeap: Deploy ML Pipelines to Production☆1,536Mar 10, 2026Updated last month
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,615Updated this week
- An open source ML system for the end-to-end data science lifecycle☆1,084Updated this week
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,894Sep 16, 2022Updated 3 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Apr 18, 2017Updated 9 years ago
- Simple and Distributed Machine Learning☆5,223Apr 10, 2026Updated last week
- Base classes to use when writing tests with Spark☆1,552Apr 12, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,763Jan 28, 2026Updated 2 months ago
- PredictionIO, a machine learning server for developers and ML engineers.☆12,530Jan 9, 2021Updated 5 years ago
- Breeze is/was a numerical processing library for Scala.☆3,455Oct 4, 2025Updated 6 months ago
- Drizzle integration with Apache Spark☆120Sep 11, 2018Updated 7 years ago
- R interface for Apache Spark☆967Feb 9, 2026Updated 2 months ago
- Apache Spark to Apache Cassandra connector☆1,951Apr 29, 2025Updated 11 months ago
- Distributed Deep Learning on Spark☆403Oct 8, 2016Updated 9 years ago