Apache Hadoop
☆15,506Mar 13, 2026Updated last week
Alternatives and similar repositories for hadoop
Users that are interested in hadoop are comparing it to the libraries listed below
Sorting:
- Apache Spark - A unified analytics engine for large-scale data processing☆43,001Updated this week
- Apache Hive☆6,014Updated this week
- Apache HBase☆5,580Updated this week
- Apache Kafka - A distributed event streaming platform☆32,158Updated this week
- Apache ZooKeeper☆12,742Mar 13, 2026Updated last week
- Apache Flink☆25,875Updated this week
- Apache Storm☆6,676Updated this week
- An index of all open-source data☆4,807Oct 6, 2025Updated 5 months ago
- An unofficial repository of National Park Service data.☆1,253Mar 13, 2026Updated last week
- Assorted data from the General Services Administration.☆2,255Apr 17, 2024Updated last year
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,558Oct 10, 2024Updated last year
- Apache Cassandra®☆9,656Updated this week
- Free and Open Source, Distributed, RESTful Search Engine☆76,362Updated this week
- ID3-based implementation of the ML Decision Tree algorithm☆1,474Oct 31, 2018Updated 7 years ago
- The java implementation of Apache Dubbo. An RPC and microservice framework.☆41,708Updated this week
- Netty project - an event-driven asynchronous network application framework☆34,844Mar 14, 2026Updated last week
- The official home of the Presto distributed SQL query engine for big data☆16,668Updated this week
- Apache Tomcat☆8,114Mar 13, 2026Updated last week
- Apache Kylin☆3,770Mar 13, 2026Updated last week
- Principal Component Analysis on music loops☆780May 11, 2017Updated 8 years ago
- For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data s…☆73,460Updated this week
- Large-scale linear classification, regression and ranking in Python☆1,769Jul 18, 2023Updated 2 years ago
- Ruby gem to calculate the similarity between texts using tf*idf☆775Feb 26, 2024Updated 2 years ago
- Mirror of Apache Sqoop☆977Apr 8, 2021Updated 4 years ago
- Google core libraries for Java☆51,505Updated this week
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,295Feb 20, 2026Updated last month
- Apache Druid: a high performance real-time analytics database.☆13,962Updated this week
- Apache Mesos☆5,362Aug 23, 2024Updated last year
- Apache Maven core☆4,987Updated this week
- Apache Doris is an easy-to-use, high performance and unified analytics database.☆15,114Updated this week
- Production-Grade Container Scheduling and Management☆121,194Updated this week
- Apache Lucene and Solr open-source search software☆4,367Sep 25, 2024Updated last year
- Spring Framework☆59,763Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,121Mar 13, 2026Updated last week
- Apache Pulsar - distributed pub-sub messaging system☆15,168Updated this week
- An Open Source Machine Learning Framework for Everyone☆194,195Updated this week
- Apache RocketMQ is a cloud native messaging and streaming platform, making it simple to build event-driven applications.☆22,374Updated this week
- Cool links & research papers related to Machine Learning applied to source code (MLonCode)☆6,539Dec 3, 2020Updated 5 years ago
- Apache Thrift☆10,908Updated this week