tomwhite / hadoop-bookView external linksLinks
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
☆3,511Mar 17, 2020Updated 5 years ago
Alternatives and similar repositories for hadoop-book
Users that are interested in hadoop-book are comparing it to the libraries listed below
Sorting:
- Apache Hadoop☆15,473Feb 8, 2026Updated last week
- Contains the code used in the HBase: The Definitive Guide book.☆907Oct 4, 2022Updated 3 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆42,810Updated this week
- Code repository for O'Reilly Hadoop Application Architectures book☆163May 26, 2015Updated 10 years ago
- Notes talking about the design and implementation of Apache Spark☆5,357Apr 2, 2024Updated last year
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆203Feb 11, 2020Updated 6 years ago
- Apache Flink☆25,781Updated this week
- Apache HBase☆5,584Updated this week
- eclipse plugin for hadoop 2.2.0 , 2.4.1☆558Jan 24, 2019Updated 7 years ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,527Sep 25, 2024Updated last year
- Apache Hive☆6,007Updated this week
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,485May 18, 2022Updated 3 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,084Oct 14, 2024Updated last year
- Mirror of Apache Kafka☆31,881Updated this week
- Apache Storm☆6,671Feb 4, 2026Updated last week
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆2,049Updated this week
- Apache ZooKeeper☆12,720Jan 28, 2026Updated 2 weeks ago
- This is a code example that complements the material in the ZooKeeper O'Reilly book.☆400Dec 11, 2025Updated 2 months ago
- The java implementation of Apache Dubbo. An RPC and microservice framework.☆41,729Updated this week
- Google core libraries for Java☆51,465Updated this week
- Run Hadoop Custer within Docker Containers☆1,829Jul 1, 2024Updated last year
- Netty project - an event-driven asynchronous network application framework☆34,779Feb 6, 2026Updated last week
- 阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池☆28,221Updated this week
- The official home of the Presto distributed SQL query engine for big data☆16,653Updated this week
- flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Ta…☆15,042Mar 12, 2025Updated 11 months ago
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆14,225Feb 5, 2026Updated last week
- Enterprise Stream Process Engine☆3,889Jun 16, 2023Updated 2 years ago
- A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources☆1,115May 7, 2024Updated last year
- CMAK is a tool for managing Apache Kafka clusters☆11,951Aug 2, 2023Updated 2 years ago
- Open source SQL Query Assistant service for Databases/Warehouses☆1,466Updated this week
- Hadoop (Utilities, Patches and Examples)☆243Jun 21, 2016Updated 9 years ago
- Hadoop docker image☆1,208Jun 25, 2020Updated 5 years ago
- 阿里巴巴 MySQL binlog 增量订阅&消费组件☆29,616Jan 28, 2026Updated 2 weeks ago
- Free and Open Source, Distributed, RESTful Search Engine☆76,061Updated this week
- Azkaban workflow manager.☆4,517Jul 3, 2024Updated last year
- Deep Learning Book Chinese Translation☆37,190Dec 3, 2019Updated 6 years ago
- Spring Framework☆59,601Updated this week
- Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.☆20,674Updated this week
- Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang☆15,056Jul 4, 2024Updated last year