apache / mahout
Mirror of Apache Mahout
☆2,163Updated this week
Alternatives and similar repositories for mahout:
Users that are interested in mahout are comparing it to the libraries listed below
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,782Updated 3 years ago
- An open source ML system for the end-to-end data science lifecycle☆1,041Updated this week
- Apache Storm☆6,614Updated this week
- LensKit recommender toolkit.☆974Updated 3 years ago
- Please visit https://github.com/h2oai/h2o-3 for latest H2O☆2,223Updated 4 months ago
- Mirror of Apache Sqoop☆978Updated 3 years ago
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,084Updated last year
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- Apache HBase☆5,310Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,545Updated 5 months ago
- PredictionIO, a machine learning server for developers and ML engineers.☆12,530Updated 4 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,238Updated this week
- Interactive and Reactive Data Science using Scala and Spark.☆3,146Updated last year
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,628Updated 2 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,471Updated this week
- REST job server for Apache Spark☆2,836Updated 2 months ago
- Machine Learning Platform and Recommendation Engine built on Kubernetes☆1,471Updated 4 years ago
- Apache Hive☆5,657Updated this week
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,935Updated this week
- Mirror of Apache Oozie☆723Updated last month
- Sparkling Water provides H2O functionality inside Spark cluster☆966Updated 4 months ago
- Mirror of Apache Pig☆687Updated 5 months ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,071Updated 5 months ago
- Apache Phoenix☆1,037Updated this week
- Distributed Graph Database☆5,242Updated 2 years ago
- A connector for Spark that allows reading and writing to/from Redis cluster☆946Updated 5 months ago
- DataStax Connector for Apache Spark to Apache Cassandra☆1,944Updated this week
- Azkaban workflow manager.☆4,490Updated 8 months ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,354Updated last year