whitfin / s3-concat
Concatenate Amazon S3 files remotely using flexible patterns
☆39Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for s3-concat
- Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI☆53Updated 3 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Gather metadata about your S3 buckets☆48Updated 3 years ago
- Run commands on your ECS container instances.☆10Updated 8 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Proximity searches using Redis as backend☆12Updated 8 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- Compile JSON Schema into Avro and BigQuery schemas☆43Updated 9 months ago
- Forwards syslog messages to Kafka☆16Updated 9 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 4 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 4 years ago
- Deprecated, use https://github.com/mozilla-services/iprepd☆15Updated 6 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated 7 months ago
- Walk an Amazon s3 path hierarchy☆33Updated this week
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 2 months ago
- Web UI for Cassandra Reaper☆22Updated 7 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 7 years ago
- ☆20Updated last year
- dynamically parse protobuf message then convert to avro☆25Updated 9 years ago
- Real-time flowchart visualisation for Kafka-based distributed systems.☆121Updated 7 years ago
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆34Updated 2 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated last year
- A highly scalable collector for tricorder applications☆10Updated 6 years ago
- Library and worker to handle transfer of data in s3 into redshift. Includes table creation and manipulation, as well as time-based insert…☆61Updated last year
- A Go implementation of the ReactiveSocket Protocol☆13Updated 7 years ago
- Terraform samples to sync and use IAM users ssh keys to connect to EC2 instances☆13Updated 5 years ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated 10 months ago
- Code and architecture diagrams for performance testing a few API approaches on AWS☆11Updated 5 years ago