qubole / streamxLinks

kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)

☆95

Alternatives and similar repositories for streamx

Users that are interested in streamx are comparing it to the libraries listed below

Sorting:

wix-incubator / kafka-connect-s3
A Kafka-Connect Sink for S3 with no Hadoop dependencies.
☆57Updated 2 years ago
schema-repo / schema-repo
The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.
☆154Updated 3 years ago
SinghAsDev / pankh
☆76Updated 10 years ago
wushujames / kafka-utilities
☆26Updated 5 years ago
rdblue / s3committer
Hadoop output committers for S3
☆111Updated 5 years ago
stripe-archive / herringbone
Tools for working with parquet, impala, and hive
☆134Updated 4 years ago
ottogroup / schedoscope
Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…
☆96Updated 6 years ago
tuplejump / kafka-connect-cassandra
Kafka Connect Cassandra Connector. This project includes source/sink connectors for Cassandra to/from Kafka.
☆78Updated 9 years ago
snowplow-archive / spark-example-project
A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR
☆119Updated 9 years ago
ZEPL / z-manager
Simplify getting Zeppelin up and running
☆56Updated 9 years ago
memsql / singlestore-spark-connector
A connector for SingleStore and Spark
☆162Updated 2 months ago
hammerlab / grafana-spark-dashboards
Scripts for generating Grafana dashboards for monitoring Spark jobs
☆240Updated 10 years ago
miguno / avro-cli-examples
Examples on how to use the command line tools in Avro Tools to read and write Avro files
☆153Updated last year
snowplow-archive / spark-streaming-example-project
A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB
☆93Updated 5 years ago
spotify / spydra
Ephemeral Hadoop clusters using Google Compute Platform
☆134Updated 3 years ago
tresata / spark-kafka
Low level integration of Spark and Kafka
☆130Updated 7 years ago
dataArtisans / flink-dataflow
Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)
☆88Updated 9 years ago
lensesio / kafka-connect-query-language
SQL for Kafka Connectors
☆99Updated last year
stealthly / metrics-kafka
Metrics produced to Kafka and consumers for monitoring them
☆101Updated 10 years ago
wepay / kafka-connect-bigquery
DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector
☆152Updated last year
lensesio / kafka-connect-tools
Kafka Connect Tooling
☆117Updated 4 years ago
blackberry / KaBoom
A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths
☆42Updated 9 years ago
KeithSSmith / spark-compaction
File compaction tool that runs on top of the Spark framework.
☆59Updated 6 years ago
VeritoneAlpha / jaws-spark-sql-rest
☆92Updated 8 years ago
wushujames / kafka-connector-skeleton
A fork of the Apache Kafka "connect-file" Kafka Connect, to use as a starting point to write your own Kafka connectors.
☆37Updated 7 years ago
hbutani / spark-datetime
functionstest
☆33Updated 9 years ago
ImpalaToGo / ImpalaToGo
Fork of Cloudera Impala separated from Hadoop
☆42Updated 9 years ago
saurfang / sparksql-protobuf
Read SparkSQL parquet file as RDD[Protobuf]
☆93Updated 7 years ago
dataArtisans / cascading-flink
Cascading on Apache Flink®
☆54Updated last year
brightcove-archive / ooyala_metrics-storm
Easy metrics collection for Storm topologies using Coda Hale Metrics
☆101Updated 12 years ago