Big-Data-Manning/big-data-code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Big-Data-Manning/big-data-code)

Big-Data-Manning / big-data-code

Source code for Big Data: Principles and best practices of scalable realtime data systems

☆332

Alternatives and similar repositories for big-data-code

Users that are interested in big-data-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nathanmarz / dfs-datastores
View on GitHub
Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.
☆215Jun 29, 2016Updated 10 years ago
mhausenblas / lambda-architecture.net
View on GitHub
A repository of information, examples and good practices around the Lambda Architecture
☆369Oct 26, 2017Updated 8 years ago
dataArtisans / cascading-flink
View on GitHub
Cascading on Apache Flink®
☆54Feb 5, 2024Updated 2 years ago
tmatyashovsky / lambda-architecture-jeeconf-kyiv
View on GitHub
Simple Lambda Architecture implementation based on Apache Spark (Core, SQL, Streaming)
☆40Feb 19, 2017Updated 9 years ago
mingfang / docker-druid
View on GitHub
☆34May 24, 2014Updated 12 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
learning-spring-boot / learning-spring-boot-code-1.2
View on GitHub
This version is frozen. See the link for all versions of this book's code
☆10May 3, 2016Updated 10 years ago
spark-in-action / first-edition
View on GitHub
The book's repo
☆270Jul 1, 2017Updated 9 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
ezbz / jmxtrans-lib
View on GitHub
JMXTrans configuration for hadoop/cassandra/zookeeper
☆31Dec 3, 2015Updated 10 years ago
OryxProject / oryx
View on GitHub
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
☆1,783Aug 16, 2021Updated 4 years ago
ExNexu / hdfs-scala-example
View on GitHub
☆13Sep 16, 2013Updated 12 years ago
mahmoudparsian / data-algorithms-book
View on GitHub
MapReduce, Spark, Java, and Scala for Data Algorithms Book
☆1,081Oct 14, 2024Updated last year
amoAHCP / vert.x-microservice
View on GitHub
A Vert.x based micro service framework
☆12Apr 28, 2016Updated 10 years ago
mayanhui / storm-in-action
View on GitHub
book 《storm in action》 source code
☆24Mar 16, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Cascading / tutorials
View on GitHub
Tutorials for Cascading, Lingual, Pattern and other projects
☆18Aug 30, 2016Updated 9 years ago
apache / cassandra-spark-connector
View on GitHub
Apache Spark to Apache Cassandra connector
☆1,949Apr 29, 2025Updated last year
apache / harmony-drlvm
View on GitHub
Mirror of Apache Harmony DRLVM
☆14May 15, 2026Updated last month
mattflax / dropwizard-tika-server
View on GitHub
A DropWizard wrapper around Apache Tika.
☆10Dec 22, 2016Updated 9 years ago
jinfengr / time-series-compression
View on GitHub
Compressing and Decoding Term Statistics Time Series -- ECIR 2016
☆10Dec 11, 2015Updated 10 years ago
jbarrez / spring-boot-activiti-example
View on GitHub
☆12Mar 20, 2015Updated 11 years ago
lucidworks / simple-category-extraction-component
View on GitHub
Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem
☆11Jan 27, 2025Updated last year
Azure-Samples / hdinsight-spark-scala-kafka
View on GitHub
A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight
☆13Mar 2, 2023Updated 3 years ago
rapportive-oss / storm-amqp-spout
View on GitHub
Allows a Storm topology to consume an AMQP exchange as an input source.
☆55Oct 3, 2012Updated 13 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alexholmes / hiped2
View on GitHub
Source code that accompanies the book "Hadoop in Practice, Second Edition".
☆80Sep 10, 2014Updated 11 years ago
ArchitectingHBase / examples
View on GitHub
Will come later...
☆20Jul 1, 2022Updated 4 years ago
amollenkopf / dcos-iot-demo
View on GitHub
This project demonstrates how to configure a full stack geo-enabled Internet of Things (IoT) solution using Mesosphere's open sourced Dat…
☆112May 3, 2018Updated 8 years ago
softwaremill / activator-reactive-kafka-scala
View on GitHub
Activator template for Reactive Kafka
☆20Nov 22, 2016Updated 9 years ago
alexanderdean / Unified-Log-Processing
View on GitHub
Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)
☆98Jul 22, 2022Updated 3 years ago
nraychaudhuri / scalainaction
View on GitHub
Code examples from scala in action book
☆117Feb 9, 2022Updated 4 years ago
swenson / python-xr
View on GitHub
Python source code cross reference
☆17Aug 10, 2016Updated 9 years ago
tomwhite / hadoop-book
View on GitHub
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
☆3,502Mar 17, 2020Updated 6 years ago
PacktPublishing / Fundamentals-of-Apache-Flink
View on GitHub
Fundamentals of Apache Flink [video], published by Packt
☆12Jan 30, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
leowhitehead / MontiLang
View on GitHub
A Stack-Oriented Imperative Programming language
☆11Sep 22, 2019Updated 6 years ago
schleyfox / storm-test
View on GitHub
Testing utilities for storm
☆40Jan 5, 2012Updated 14 years ago
thammegowda / tika-ner-corenlp
View on GitHub
Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser
☆13Feb 26, 2022Updated 4 years ago
ad-tech-group / openssp-docs
View on GitHub
☆12Jan 4, 2021Updated 5 years ago
peter-lawrey / Java-Chronicle-OLD
View on GitHub
☆34Feb 17, 2016Updated 10 years ago
memsql / streamliner-starter
View on GitHub
Starter project for building MemSQL Streamliner Pipelines
☆32Apr 18, 2017Updated 9 years ago
pereferrera / storm-feeds-example
View on GitHub
This is a toy example for illustrating the usefulness of Storm in two use cases: stream processing and continuous computation.
☆41Oct 12, 2020Updated 5 years ago