apache/samza

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/samza)

apache / samza

Mirror of Apache Samza

☆846

Alternatives and similar repositories for samza

Users that are interested in samza are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / samza-hello-samza
View on GitHub
Mirror of Apache Samza
☆111May 15, 2026Updated 2 months ago
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,630Mar 1, 2023Updated 3 years ago
apache / storm
View on GitHub
Apache Storm
☆6,688Updated this week
apache / beam
View on GitHub
Apache Beam is a unified programming model for Batch and Streaming data processing.
☆8,629Updated this week
apache / drill
View on GitHub
Apache Drill is a distributed MPP query layer for self describing data
☆2,019Jun 26, 2026Updated 2 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
apache / helix
View on GitHub
Mirror of Apache Helix
☆503Jul 9, 2026Updated last week
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,270Jun 24, 2026Updated 3 weeks ago
apache / druid
View on GitHub
Apache Druid: a high performance real-time analytics database.
☆14,026Updated this week
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,107Updated this week
apache / flink
View on GitHub
Apache Flink
☆26,178Updated this week
apache / mesos
View on GitHub
Apache Mesos
☆5,369May 15, 2026Updated 2 months ago
apache / avro
View on GitHub
Apache Avro is a data serialization system.
☆3,285Updated this week
apache / bookkeeper
View on GitHub
Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads
☆2,006Updated this week
apache / kafka
View on GitHub
Apache Kafka - A distributed event streaming platform
☆33,225Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / apex-core
View on GitHub
Mirror of Apache Apex core
☆350Jun 7, 2021Updated 5 years ago
apache / logging-flume
View on GitHub
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…
☆2,565Jul 10, 2026Updated last week
prestodb / presto
View on GitHub
The official home of the Presto distributed SQL query engine for big data
☆16,716Updated this week
apache / carbondata
View on GitHub
High performance data store solution
☆1,447Jul 4, 2026Updated last week
apache / hbase
View on GitHub
Apache HBase
☆5,541Updated this week
apache / cassandra
View on GitHub
Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructu…
☆9,914Updated this week
apache / oozie
View on GitHub
Mirror of Apache Oozie
☆729Jan 27, 2025Updated last year
apache / tez
View on GitHub
Apache Tez
☆516Updated this week
apache / phoenix
View on GitHub
Apache Phoenix
☆1,060Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
apache / kylin
View on GitHub
Apache Kylin
☆3,767Updated this week
apache / parquet-java
View on GitHub
Apache Parquet Java
☆3,065Updated this week
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,639Updated this week
apache / geode
View on GitHub
Apache Geode
☆2,369Updated this week
confluentinc / ksql
View on GitHub
The database purpose-built for stream processing applications.
☆310Updated this week
gearpump / gearpump
View on GitHub
Lightweight real-time big data streaming engine over Akka
☆756Updated this week
apache / calcite
View on GitHub
Apache Calcite
☆5,154Updated this week
apache / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆43,617Updated this week
apache / aurora
View on GitHub
Apache Aurora - A Mesos framework for long-running services, cron jobs, and ad-hoc jobs
☆636Feb 23, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
linkedin / ambry
View on GitHub
Distributed object store
☆1,787Jun 23, 2026Updated 3 weeks ago
apache / accumulo
View on GitHub
Apache Accumulo
☆1,155Updated this week
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
akka / akka-core
View on GitHub
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
☆13,282Updated this week
apache / kudu
View on GitHub
Mirror of Apache Kudu
☆1,904Updated this week
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,211Apr 29, 2025Updated last year
apache / ignite
View on GitHub
Apache Ignite
☆5,071Updated this week