Netflix / s3mper
s3mper - Consistent Listing for S3
☆228Updated 2 years ago
Alternatives and similar repositories for s3mper:
Users that are interested in s3mper are comparing it to the libraries listed below
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Compare eventual consistency of object stores☆172Updated last year
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆156Updated 2 years ago
- Hadoop output committers for S3☆109Updated 4 years ago
- DEPRECATED—Open source Apache Cassandra running on DC/OS is now replaced by mesosphere/dcos-commons/frameworks/cassandra. This repositor…☆116Updated 6 years ago
- A Bulk Data Pipeline out of Cassandra☆323Updated 5 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Updated 8 years ago
- Failure inducer framework☆190Updated 7 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- A library to implement asynchronous dependency graphs for services in Java☆258Updated 2 years ago
- A Cascading Workflow Visualizer☆83Updated last year
- ARCHIVED: Log4J Appender for writing data into a Kinesis Stream☆62Updated 6 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆466Updated 4 years ago
- Amazon ECS Scheduler Driver☆168Updated 5 years ago
- The Amazon DynamoDB Streams Adapter implements the Amazon Kinesis interface so that your application can use KCL to consume and process d…☆99Updated 6 months ago
- a flexible metric forwarding agent☆80Updated 4 years ago
- An AWS SDK-backed FileSystem driver for Hadoop☆64Updated 4 years ago
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- Storm on Mesos!☆138Updated 3 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- type-checked dictionary templating library for python☆93Updated last year
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 9 years ago
- ☆204Updated last year
- DynamoDB data source for Apache Spark☆95Updated 3 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 3 years ago