Apache Mahout - an environment for quickly creating scalable, performant machine learning applications.
☆2,208Updated this week
Alternatives and similar repositories for mahout
Users that are interested in mahout are comparing it to the libraries listed below
Sorting:
- Apache Storm☆6,671Feb 4, 2026Updated 3 weeks ago
- Apache Spark - A unified analytics engine for large-scale data processing☆42,898Updated this week
- PredictionIO, a machine learning server for developers and ML engineers.☆12,529Jan 9, 2021Updated 5 years ago
- LensKit recommender toolkit.☆974Aug 23, 2021Updated 4 years ago
- Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and …☆14,205Updated this week
- Apache HBase☆5,588Updated this week
- Apache Hive☆6,002Updated this week
- Apache Hadoop☆15,487Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,559Oct 10, 2024Updated last year
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,786Aug 16, 2021Updated 4 years ago
- Please visit https://github.com/h2oai/h2o-3 for latest H2O☆2,324Oct 24, 2024Updated last year
- Mirror of Apache Sqoop☆979Apr 8, 2021Updated 4 years ago
- Apache Kylin☆3,766Dec 29, 2025Updated 2 months ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Dec 27, 2022Updated 3 years ago
- Apache Flink☆25,825Updated this week
- Mahout in Action Example Code☆348Jun 15, 2021Updated 4 years ago
- Apache Lucene and Solr open-source search software☆4,369Sep 25, 2024Updated last year
- Mirror of Apache Kafka☆32,065Updated this week
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,084Nov 30, 2023Updated 2 years ago
- LibRec: A Leading Java Library for Recommender Systems, see☆3,267Jul 13, 2023Updated 2 years ago
- Apache Druid: a high performance real-time analytics database.☆13,942Updated this week
- Mirror of Apache Oozie☆728Jan 27, 2025Updated last year
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,605Updated this week
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,479Apr 12, 2020Updated 5 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆2,010Jan 29, 2026Updated last month
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Oct 25, 2023Updated 2 years ago
- The official home of the Presto distributed SQL query engine for big data☆16,662Updated this week
- Apache Cassandra®☆9,644Updated this week
- Apache ZooKeeper☆12,731Feb 19, 2026Updated last week
- Apache Mesos☆5,365Aug 23, 2024Updated last year
- ☆149Mar 4, 2014Updated 11 years ago
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,291Feb 20, 2026Updated last week
- Azkaban workflow manager.☆4,515Jul 3, 2024Updated last year
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays…☆9,986Jan 15, 2024Updated 2 years ago
- Framework and Library for Distributed Online Machine Learning☆708May 16, 2019Updated 6 years ago
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,510Feb 21, 2026Updated last week
- A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.☆13,261Updated this week
- Apache Nutch is an extensible and scalable web crawler☆3,135Updated this week
- Apache Ignite☆5,044Updated this week