apache / hadoop
Apache Hadoop
☆14,944Updated this week
Alternatives and similar repositories for hadoop:
Users that are interested in hadoop are comparing it to the libraries listed below
- Apache Hive☆5,630Updated this week
- Apache HBase☆5,282Updated this week
- Apache Spark - A unified analytics engine for large-scale data processing☆40,565Updated this week
- Apache Flink☆24,545Updated this week
- Apache ZooKeeper☆12,384Updated last week
- Apache Kylin☆3,678Updated this week
- Mirror of Apache Kafka☆29,476Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,547Updated 4 months ago
- Apache Storm☆6,613Updated this week
- Apache Cassandra®☆9,021Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,210Updated this week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆6,939Updated this week
- Mirror of Apache Mahout☆2,158Updated last week
- Apache Druid: a high performance real-time analytics database.☆13,613Updated this week
- Notes talking about the design and implementation of Apache Spark☆5,300Updated 10 months ago
- Apache Lucene open-source search software☆2,862Updated this week
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,506Updated 4 years ago
- Azkaban workflow manager.☆4,489Updated 7 months ago
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,173Updated last week
- Metadata Comparison Toolkit. As of now, V-1.0.0 only consists Comparison of two DDL file ( .sql ) or two DDL statement. You can also pars…☆10Updated last year
- Apache Calcite☆4,736Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆5,653Updated this week
- Apache Parquet Java☆2,720Updated this week
- MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.☆11,137Updated last month
- Enterprise Stream Process Engine☆3,905Updated last year
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,003Updated this week
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,470Updated 2 years ago
- Apache ActiveMQ Classic☆2,340Updated 2 weeks ago
- Apache Impala☆1,185Updated this week
- High performance data store solution☆1,435Updated last month