kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
☆95Apr 4, 2019Updated 6 years ago
Alternatives and similar repositories for streamx
Users that are interested in streamx are comparing it to the libraries listed below
Sorting:
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Mar 19, 2023Updated 3 years ago
- Kafka Source and Sink Connectors☆19Jan 18, 2018Updated 8 years ago
- Quark is a data virtualization engine over analytic databases.☆100Jul 13, 2017Updated 8 years ago
- A logstash codec plugin for decoding and encoding Avro records☆26Feb 22, 2024Updated 2 years ago
- Secor is a service implementing Kafka log persistence☆1,857Mar 10, 2026Updated last week
- ☆11Oct 29, 2018Updated 7 years ago
- Java and Scala client libraries for Concord☆13Feb 15, 2017Updated 9 years ago
- Graphite integration for Kafka☆15Jun 5, 2018Updated 7 years ago
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆17Oct 13, 2017Updated 8 years ago
- Kafka Connect Connector for Jenkins Open Source Continuous Integration Tool☆31Nov 15, 2022Updated 3 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Oct 1, 2022Updated 3 years ago
- Unix tee, but for Kinesis streams☆12Oct 19, 2021Updated 4 years ago
- Real-time aggregation of metrics from large distributed systems.☆107Nov 6, 2018Updated 7 years ago
- docker image to deploy rabbitmq cluster on mesos with one marathon app☆10Oct 12, 2017Updated 8 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Jan 20, 2026Updated 2 months ago
- HDFS and MapReduce example source code accompanying wikibooks "Beginning Hadoop Programming" by Jaehwa Jung☆12Nov 30, 2014Updated 11 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Extensible Python Framework for Apache Mesos☆33Oct 19, 2017Updated 8 years ago
- ☆12Jun 10, 2023Updated 2 years ago
- Plugin to integrate consul health checks into nagios☆24Jul 11, 2018Updated 7 years ago
- A big data platform monitoring tool based on ELK stack☆20Feb 16, 2020Updated 6 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆93Mar 5, 2024Updated 2 years ago
- Kafka Sink Connector for Amazon Redshift☆23Jan 16, 2019Updated 7 years ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- ☆22Apr 24, 2016Updated 9 years ago
- stable, high-throughput journalling to S3☆101Dec 15, 2015Updated 10 years ago
- Selenium on Mesos☆12Sep 28, 2015Updated 10 years ago
- Consumes Kafka topics specified in the config, and outputs them in chunks as desired in an S3 Bucket. Keeps track of offsets via S3.☆15Sep 6, 2013Updated 12 years ago
- Usage examples for Divolte collector☆17Nov 8, 2017Updated 8 years ago
- JSONs -> JSON Schema☆153Sep 14, 2020Updated 5 years ago
- ☆76May 19, 2015Updated 10 years ago
- C++ utility library☆24Jan 3, 2014Updated 12 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Jul 13, 2016Updated 9 years ago
- How to use Parquet in Flink☆32May 2, 2017Updated 8 years ago
- Scala library for Reactive streaming Microservices, CQRS, Event Sourcing, Event Logging, & message-driven apps.☆22Feb 19, 2018Updated 8 years ago
- A mesos plugin for Relay that lets you auto-scale the number of currently running instances of a bash command☆38May 16, 2021Updated 4 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Oct 27, 2016Updated 9 years ago
- Lightweight mesos stats collector for influxdb☆16Jan 4, 2022Updated 4 years ago
- Kafka Streams DSL vs Processor API☆16Nov 26, 2017Updated 8 years ago