apache / mahoutLinks
Apache Mahout - an environment for quickly creating scalable, performant machine learning applications.
☆2,204Updated this week
Alternatives and similar repositories for mahout
Users that are interested in mahout are comparing it to the libraries listed below
Sorting:
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Updated 4 years ago
- LensKit recommender toolkit.☆973Updated 4 years ago
- Apache Storm☆6,672Updated this week
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,085Updated 2 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,084Updated last year
- An open source ML system for the end-to-end data science lifecycle☆1,079Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,559Updated last year
- Mirror of Apache Sqoop☆978Updated 4 years ago
- Mirror of Apache Pig☆686Updated 4 months ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 3 years ago
- Mirror of Apache Oozie☆727Updated last year
- Apache HBase☆5,581Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Updated 3 weeks ago
- Apache Phoenix☆1,049Updated this week
- Apache OpenNLP☆1,578Updated this week
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,698Updated 2 years ago
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆2,049Updated this week
- Apache Kylin☆3,767Updated last month
- Distributed deep learning on Hadoop and Spark clusters.☆1,262Updated 6 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,859Updated 2 years ago
- Distributed Graph Database☆5,239Updated 3 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,606Updated this week
- Apache Hive☆6,005Updated this week
- Please visit https://github.com/h2oai/h2o-3 for latest H2O☆2,329Updated last year
- PredictionIO, a machine learning server for developers and ML engineers.☆12,534Updated 5 years ago
- Mirror of Apache Hadoop common☆161Updated 5 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆2,006Updated last week
- Now redundant weka mirror. Visit https://github.com/Waikato/weka-trunk for the real deal☆329Updated 6 years ago
- Apache Geode☆2,353Updated 2 weeks ago
- Apache Lucene and Solr open-source search software☆4,370Updated last year