tomwhite / hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
☆3,507Updated 5 years ago
Alternatives and similar repositories for hadoop-book:
Users that are interested in hadoop-book are comparing it to the libraries listed below
- Contains the code used in the HBase: The Definitive Guide book.☆909Updated 2 years ago
- Apache HBase☆5,312Updated this week
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,072Updated 5 months ago
- Apache Phoenix☆1,037Updated this week
- Notes talking about the design and implementation of Apache Spark☆5,309Updated 11 months ago
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,186Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,547Updated 5 months ago
- Apache Storm☆6,614Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,471Updated 2 years ago
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,531Updated 6 months ago
- eclipse plugin for hadoop 2.2.0 , 2.4.1☆557Updated 6 years ago
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,935Updated last week
- Apache Hive☆5,664Updated this week
- Apache Kylin☆3,685Updated last week
- Apache Hadoop☆15,002Updated this week
- Apache Parquet Java☆2,761Updated last week
- Learning Apache spark,including code and data .Most part can run local.☆602Updated 3 years ago
- Mirror of Apache Sqoop☆978Updated 3 years ago
- High performance data store solution☆1,436Updated 3 weeks ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,839Updated 9 months ago
- Run Hadoop Custer within Docker Containers☆1,808Updated 8 months ago
- Apache Spark 官方文档中文版☆1,186Updated last year
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,238Updated last week
- HiBench is a big data benchmark suite.☆1,475Updated 3 months ago
- Azkaban workflow manager.☆4,490Updated 8 months ago
- REST job server for Apache Spark☆2,836Updated 2 months ago
- scala、spark使用过程中,各种测试用例以及相关资料整理☆1,086Updated 6 years ago
- Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)☆2,583Updated last year
- 挖坑与填坑☆693Updated 8 years ago