Big-Data-Manning / big-data-code
Source code for Big Data: Principles and best practices of scalable realtime data systems
☆333Updated 7 months ago
Alternatives and similar repositories for big-data-code:
Users that are interested in big-data-code are comparing it to the libraries listed below
- Data and example code for Programming Pig, by Alan F. Gates☆188Updated 8 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆166Updated 9 years ago
- [PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streamin…☆724Updated 2 years ago
- This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open sour…☆691Updated 3 years ago
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆237Updated 7 years ago
- Examples for High Performance Spark☆506Updated 2 months ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆633Updated 2 years ago
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆287Updated 8 years ago
- Examples for learning spark☆332Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Diagrams describing Apache Hadoop internals (2.3.0 or later).☆431Updated 5 years ago
- ☆77Updated 9 years ago
- The book's repo☆272Updated 7 years ago
- Spark app that demonstrates reading and writing data to from MongoDB and BSON files☆44Updated 10 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆281Updated 5 years ago
- A repository of information, examples and good practices around the Lambda Architecture☆368Updated 7 years ago
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆203Updated 4 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆344Updated 7 years ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆876Updated 4 years ago
- Contains the code used in the HBase: The Definitive Guide book.☆908Updated 2 years ago
- Sample programs for the Kafka 0.9 API☆149Updated 2 years ago
- Kite SDK Examples☆99Updated 3 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,071Updated 3 months ago
- Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.☆445Updated last year
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆153Updated 8 months ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- Repository for MapReduce Design Patterns (O'Reilly 2012) example source code☆235Updated 9 years ago
- Kite SDK☆393Updated 2 years ago
- KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apach…☆1,183Updated 8 years ago
- Practical examples of using Apache Spark in several different use cases☆103Updated 8 years ago