tomwhite / hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
☆3,507Updated 5 years ago
Alternatives and similar repositories for hadoop-book:
Users that are interested in hadoop-book are comparing it to the libraries listed below
- Code to accompany Advanced Analytics with Spark from O'Reilly Media☆1,527Updated 7 months ago
- Notes talking about the design and implementation of Apache Spark☆5,312Updated last year
- Contains the code used in the HBase: The Definitive Guide book.☆909Updated 2 years ago
- Apache Hadoop☆15,057Updated this week
- Apache Hive☆5,689Updated this week
- Mirror of Apache Mahout☆2,162Updated last week
- Apache Storm☆6,617Updated this week
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,550Updated 6 months ago
- Apache HBase☆5,328Updated this week
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,481Updated 2 years ago
- eclipse plugin for hadoop 2.2.0 , 2.4.1☆558Updated 6 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,476Updated this week
- Elasticsearch real-time search and analytics natively integrated with Hadoop☆1,938Updated last week
- REST job server for Apache Spark☆2,836Updated 3 months ago
- Apache Kylin☆3,701Updated 2 weeks ago
- scala、spark使用过程中,各种测试用例以及相关资料整理☆1,086Updated 6 years ago
- HiBench is a big data benchmark suite.☆1,474Updated 4 months ago
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆40,989Updated this week
- Apache Phoenix☆1,037Updated last week
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,074Updated 6 months ago
- Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)☆2,590Updated last year
- Learning Apache spark,including code and data .Most part can run local.☆602Updated 3 years ago
- High performance data store solution☆1,435Updated 3 weeks ago
- Enterprise Stream Process Engine☆3,895Updated last year
- MongoDB Connector for Hadoop☆1,518Updated 3 years ago
- Mirror of Apache Sqoop☆978Updated 4 years ago
- Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...☆639Updated last year
- The Internals of Apache Spark☆1,497Updated 7 months ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 2 years ago