snowplow-archive / kinesis-teeLinks
Unix tee, but for Kinesis streams
☆12Updated 3 years ago
Alternatives and similar repositories for kinesis-tee
Users that are interested in kinesis-tee are comparing it to the libraries listed below
Sorting:
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆94Updated 6 years ago
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated 2 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated last year
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 7 years ago
- Simple Samza Job Using Confluent Platform☆14Updated 9 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated this week
- Hive Storage Handler for Kinesis.☆11Updated 10 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 9 years ago
- Tool for exploring data on an Apache Kafka cluster☆42Updated 4 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 6 years ago
- ☆21Updated 2 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated 8 months ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Helpful tools for monitoring Kafka Connect☆20Updated 7 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated 2 months ago
- Kafka Connect Cassandra Connector. This project includes source/sink connectors for Cassandra to/from Kafka.☆78Updated 8 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆112Updated 5 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- kinesis-kafka-connector is connector based on Kafka Connect to publish messages to Amazon Kinesis streams or Amazon Kinesis Firehose.☆155Updated last year
- ☆22Updated 6 years ago
- DynamoDB data source for Apache Spark☆95Updated 3 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Updated 11 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka☆20Updated 9 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Updated 7 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago