apache / mahoutLinks
Mirror of Apache Mahout
☆2,169Updated last week
Alternatives and similar repositories for mahout
Users that are interested in mahout are comparing it to the libraries listed below
Sorting:
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Updated 3 years ago
- LensKit recommender toolkit.☆975Updated 3 years ago
- Apache Storm☆6,637Updated last week
- Mirror of Apache Pig☆688Updated 2 weeks ago
- Distributed deep learning on Hadoop and Spark clusters.☆1,260Updated 5 years ago
- Apache HBase☆5,351Updated this week
- Apache OpenNLP☆1,521Updated this week
- Apache Hive☆5,735Updated this week
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,087Updated last year
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,872Updated last year
- Please visit https://github.com/h2oai/h2o-3 for latest H2O☆2,222Updated 8 months ago
- Real-time Query for Hadoop; mirror of Apache Impala☆33Updated 2 years ago
- Mirror of Apache Sqoop☆983Updated 4 years ago
- Mirror of Apache Oozie☆722Updated 5 months ago
- An open source ML system for the end-to-end data science lifecycle☆1,049Updated 2 weeks ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,624Updated 2 years ago
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,555Updated 8 months ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,978Updated this week
- PredictionIO, a machine learning server for developers and ML engineers.☆12,529Updated 4 years ago
- Apache Kylin☆3,718Updated 2 months ago
- Breeze is/was a numerical processing library for Scala.☆3,457Updated 10 months ago
- Apache Nutch is an extensible and scalable web crawler☆3,038Updated 3 months ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,472Updated 5 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,239Updated last month
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,530Updated 9 months ago
- Apache Phoenix☆1,041Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,510Updated this week
- A connector for Spark that allows reading and writing to/from Redis cluster☆946Updated 8 months ago
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,942Updated this week
- REST job server for Apache Spark☆2,839Updated 2 months ago