apache / hadoop
Apache Hadoop
☆15,011Updated this week
Alternatives and similar repositories for hadoop:
Users that are interested in hadoop are comparing it to the libraries listed below
- Apache HBase☆5,312Updated this week
- Apache Hive☆5,667Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆40,832Updated this week
- Apache Storm☆6,612Updated last week
- Mirror of Apache Kafka☆29,766Updated this week
- Apache ZooKeeper☆12,427Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,547Updated 5 months ago
- Apache Kylin☆3,685Updated 2 weeks ago
- Apache Flink☆24,683Updated this week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆6,964Updated this week
- Mirror of Apache Mahout☆2,163Updated this week
- Apache Cassandra®☆9,115Updated this week
- Apache Druid: a high performance real-time analytics database.☆13,652Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,474Updated last week
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,189Updated this week
- Apache Mesos☆5,301Updated 7 months ago
- Apache Maven core☆4,558Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆5,709Updated this week
- Apache Pulsar - distributed pub-sub messaging system☆14,514Updated this week
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,935Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,288Updated this week
- Apache NiFi☆5,201Updated this week
- Apache Calcite☆4,772Updated this week
- PredictionIO, a machine learning server for developers and ML engineers.☆12,528Updated 4 years ago
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,506Updated 5 years ago
- Apache Tomcat☆7,764Updated this week
- Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and …☆13,897Updated this week
- Apache Avro is a data serialization system.☆3,038Updated last week
- Apache Iceberg☆7,087Updated this week
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,058Updated this week