jeoffreylim / maelstromLinks

Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.

☆22

Alternatives and similar repositories for maelstrom

Users that are interested in maelstrom are comparing it to the libraries listed below

Sorting:

phatak-dev / flink-examples
Flink Examples
☆39Updated 9 years ago
ottogroup / SPQR
Spooker is a dynamic framework for processing high volume data streams via processing pipelines
☆30Updated 9 years ago
ottogroup / flink-operator-library
Collection of generic Apache Flink operators
☆17Updated 8 years ago
blackberry / KaBoom
A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths
☆42Updated 9 years ago
project-flink / flink-perf
Flink performance tests
☆28Updated 5 years ago
RedisLabs / spark-timeseries
A library for financial and time series calculations on Apache Spark
☆28Updated 9 years ago
hortonworks-spark / spark-schema-registry
Schema Registry integration for Apache Spark
☆40Updated 2 years ago
bluejoe2008 / spark-http-stream
spark structured streaming via HTTP communication
☆18Updated 3 years ago
maropu / spark-sql-server
Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol
☆34Updated 2 years ago
flipkart-incubator / kafka-filtering
Very fast & efficient grep for Kafka stream
☆41Updated 9 years ago
uber / uberscriptquery
UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
☆61Updated last year
wushujames / kafka-utilities
☆26Updated 5 years ago
yamrcraft / etl-light
A light Kafka to HDFS/S3 ETL library based on Apache Spark
☆41Updated 8 years ago
dataArtisans / cascading-flink
Cascading on Apache Flink®
☆54Updated last year
cerner / common-kafka
Common utilities for Apache Kafka
☆36Updated last year
streaming-analytics / Styx
Streaming Analytics platform, built with Apache Flink and Kafka
☆35Updated last year
jinyeluo / smarthbasecompactor
a smart, automated non-intrusive driver for hbase region-level major-compact
☆8Updated 9 years ago
bomeng / Heracles
High performance HBase / Spark SQL engine
☆28Updated 3 years ago
aljoscha / flink-fault-tolerant-stream-example
An example of using Flink for Fault-Tolerant Stream Processing
☆12Updated 6 years ago
ftrossbach / kiqr
A distributed generic query layer for Apache Kafka Interactive Queries
☆26Updated 7 years ago
ExpediaGroup / jasvorno
A library for strong, schema based conversion between 'natural' JSON documents and Avro
☆18Updated last year
tuplejump / snackfs
HDFS compatible Distributed Filesystem backed Cassandra
☆25Updated 9 years ago
jkorab / ameliant-tools
A set of tools to ease working with Zookeeper and Kafka.
☆23Updated 9 years ago
allegro / camus-compressor
Camus Compressor merges files created by Camus and saves them in a compressed format.
☆13Updated 2 years ago
ndolgov / experiments
Code examples for my blog posts
☆22Updated 6 years ago
memsql / streamliner-starter
Starter project for building MemSQL Streamliner Pipelines
☆32Updated 8 years ago
milinda / samza-sql
SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka
☆29Updated 9 years ago
quantiply / grafana-druid-wikipedia
Example using Grafana with Druid
☆11Updated 10 years ago
Samsung / spark-cep
Spark CEP is an extension of Spark Streaming to support SQL-based query processing
☆56Updated 8 years ago
srikalyc / Sql4D
Sql interface to druid.
☆77Updated 9 years ago