libin / s3distcpLinks
☆20Updated 4 years ago
Alternatives and similar repositories for s3distcp
Users that are interested in s3distcp are comparing it to the libraries listed below
Sorting:
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Kinesis spout for Storm☆107Updated 7 years ago
- Amazon Elastic MapReduce code samples☆63Updated 9 years ago
- Open Source Cloud Formation☆59Updated 10 years ago
- Integrating AWS Lambda with EC2 hosted Relational Databases☆43Updated 9 years ago
- Simple Samza Job Using Confluent Platform☆14Updated 9 years ago
- Amazon Kinesis Aggregators provides a simple way to create real time aggregations of data on Amazon Kinesis.☆151Updated 4 years ago
- A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.☆39Updated 5 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Streaming left joins in Kafka for change data capture☆52Updated last year
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 9 years ago
- s3mper - Consistent Listing for S3☆229Updated 2 years ago
- Place ASGs on the right Spot Market☆39Updated 8 years ago
- Concatenate Amazon S3 files remotely using flexible patterns☆38Updated 4 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 5 years ago
- ARCHIVED: Log4J Appender for writing data into a Kinesis Stream☆62Updated 7 years ago
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Updated last year
- Compare eventual consistency of object stores☆174Updated last year
- Ephemeral Hadoop clusters using Google Compute Platform☆136Updated 3 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 4 years ago
- Redshift Ops Console☆92Updated 9 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated last month
- Integration of Samza and Luwak☆100Updated 10 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 2 years ago
- Automated deploy for Kafka on AWS☆123Updated 13 years ago
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated 2 years ago