whitfin / s3-concat
Concatenate Amazon S3 files remotely using flexible patterns
☆39Updated 4 years ago
Alternatives and similar repositories for s3-concat:
Users that are interested in s3-concat are comparing it to the libraries listed below
- Gather metadata about your S3 buckets☆49Updated 4 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- An HFile-backed Key-Value Server☆42Updated 5 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆27Updated last month
- Rovers is a service to retrieve repository URLs from multiple repository hosting providers.☆14Updated 5 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 6 months ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- Walk an Amazon s3 path hierarchy☆33Updated this week
- Library and worker to handle transfer of data in s3 into redshift. Includes table creation and manipulation, as well as time-based insert…☆61Updated 2 years ago
- Presto connector to Amazon Kinesis service.☆14Updated 5 years ago
- Python bindings for TrailDB☆39Updated 5 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- PREVIEW - Run Bonobo data processing graphs in docker containers.☆13Updated 2 years ago
- ☆19Updated 7 years ago
- Pilosa Dev Kit - implementation tooling and use case examples are here!☆31Updated 2 years ago
- Run commands on your ECS container instances.☆10Updated 9 years ago
- dynamically parse protobuf message then convert to avro☆25Updated 9 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year
- Docker image for Apache Hive running on Tez☆7Updated 10 years ago
- logq - Analyzing log files in PartiQL with command-line toolkit, implemented in Rust☆46Updated 2 years ago
- A small library for using unfiltered as a finagle frontend.☆22Updated 11 years ago
- Parquet Command-line Tools☆18Updated 8 years ago
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- Deprecated, use https://github.com/mozilla-services/iprepd☆15Updated 6 years ago
- A lightweight daemon for counting unique events using Redis and PostgreSQL☆35Updated 7 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago