Source code for Big Data: Principles and best practices of scalable realtime data systems
☆333Jun 8, 2024Updated last year
Alternatives and similar repositories for big-data-code
Users that are interested in big-data-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.☆216Jun 29, 2016Updated 9 years ago
- A repository of information, examples and good practices around the Lambda Architecture☆369Oct 26, 2017Updated 8 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- Simple Lambda Architecture implementation based on Apache Spark (Core, SQL, Streaming)☆40Feb 19, 2017Updated 9 years ago
- Code files uploaded by Packt publishing☆33Jan 14, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆33May 24, 2014Updated 11 years ago
- SentiStorm - Real-time Twitter Sentiment Classification based on Apache Storm☆10May 22, 2018Updated 7 years ago
- The book's repo☆274Jul 1, 2017Updated 8 years ago
- JMXTrans configuration for hadoop/cassandra/zookeeper☆31Dec 3, 2015Updated 10 years ago
- A toy example of a "Lambda architecture" using Storm's Trident as real-time layer and Splout SQL as batch layer.☆76Feb 11, 2020Updated 6 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Sep 10, 2015Updated 10 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- ☆14Sep 16, 2013Updated 12 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,080Oct 14, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Vert.x based micro service framework☆12Apr 28, 2016Updated 9 years ago
- Apache Spark to Apache Cassandra connector☆1,951Apr 29, 2025Updated 10 months ago
- Mirror of Apache Harmony DRLVM☆14Mar 21, 2010Updated 16 years ago
- Compressing and Decoding Term Statistics Time Series -- ECIR 2016☆10Dec 11, 2015Updated 10 years ago
- mirror of google code version for git updates☆25Apr 5, 2014Updated 11 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- ☆12Mar 20, 2015Updated 11 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 3 years ago
- Reports the resource usage of Docker containers to InfluxDB☆39Dec 10, 2014Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code that accompanies the book "Hadoop in Practice, Second Edition".☆80Sep 10, 2014Updated 11 years ago
- Will come later...☆20Jul 1, 2022Updated 3 years ago
- This project demonstrates how to configure a full stack geo-enabled Internet of Things (IoT) solution using Mesosphere's open sourced Dat…☆113May 3, 2018Updated 7 years ago
- Activator template for Reactive Kafka☆20Nov 22, 2016Updated 9 years ago
- Code examples from scala in action book☆119Feb 9, 2022Updated 4 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Jul 22, 2022Updated 3 years ago
- Python source code cross reference☆17Aug 10, 2016Updated 9 years ago
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,505Mar 17, 2020Updated 6 years ago
- Fundamentals of Apache Flink [video], published by Packt☆12Jan 30, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Stack-Oriented Imperative Programming language☆11Sep 22, 2019Updated 6 years ago
- Testing utilities for storm☆40Jan 5, 2012Updated 14 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Apr 18, 2017Updated 8 years ago
- Storm JMS Integration☆80Dec 15, 2022Updated 3 years ago
- Lightweight real-time big data streaming engine over Akka☆758Mar 1, 2022Updated 4 years ago
- DEEP BERLIN AI for Good Hackathon 2020☆14Apr 21, 2020Updated 5 years ago
- Assets used in Apress -- Scalable Big Data Architecture -- book☆20Dec 11, 2015Updated 10 years ago