apache / mahoutLinks
Mirror of Apache Mahout
☆2,174Updated 3 weeks ago
Alternatives and similar repositories for mahout
Users that are interested in mahout are comparing it to the libraries listed below
Sorting:
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,786Updated 4 years ago
- LensKit recommender toolkit.☆974Updated 4 years ago
- Apache Storm☆6,648Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,557Updated 10 months ago
- Mirror of Apache Pig☆686Updated this week
- Apache HBase☆5,378Updated this week
- Mirror of Apache Sqoop☆984Updated 4 years ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- Please visit https://github.com/h2oai/h2o-3 for latest H2O☆2,221Updated 10 months ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,079Updated 10 months ago
- An open source ML system for the end-to-end data science lifecycle☆1,062Updated this week
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,942Updated this week
- Apache Hive☆5,780Updated this week
- Apache Kylin☆3,740Updated last week
- Apache Phoenix☆1,046Updated last week
- Mirror of Apache Oozie☆725Updated 7 months ago
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,088Updated last year
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,868Updated 2 years ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,990Updated 3 weeks ago
- Now redundant weka mirror. Visit https://github.com/Waikato/weka-trunk for the real deal☆328Updated 6 years ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,531Updated 11 months ago
- Apache Lucene and Solr open-source search software☆4,376Updated 11 months ago
- Apache OpenNLP☆1,535Updated this week
- Contains the code used in the HBase: The Definitive Guide book.☆910Updated 2 years ago
- A connector for Spark that allows reading and writing to/from Redis cluster☆946Updated 10 months ago
- Java Evaluator API for PMML☆904Updated last month
- Distributed deep learning on Hadoop and Spark clusters.☆1,259Updated 5 years ago
- Apache Hadoop☆15,231Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,244Updated last week
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,621Updated 2 years ago