whitfin / s3-concatLinks
Concatenate Amazon S3 files remotely using flexible patterns
☆38Updated 4 years ago
Alternatives and similar repositories for s3-concat
Users that are interested in s3-concat are comparing it to the libraries listed below
Sorting:
- Gather metadata about your S3 buckets☆49Updated 4 years ago
- Compile JSON Schema into Avro and BigQuery schemas☆44Updated last year
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated this week
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- ☆20Updated 3 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- Python bindings for TrailDB☆39Updated 5 years ago
- JSONCDC is now maintained at,☆90Updated 7 years ago
- dynamically parse protobuf message then convert to avro☆25Updated 10 years ago
- An HFile-backed Key-Value Server☆42Updated 6 years ago
- Scala implementation of a credstash client☆10Updated 8 years ago
- Twitter Streaming API Example with Kafka Streams in Scala☆49Updated 8 years ago
- Find out what's going on in your services environment!☆11Updated 8 years ago
- Luigi Plugin for Hubot☆36Updated 8 years ago
- Weighted linear regression☆15Updated 4 years ago
- Parquet Command-line Tools☆18Updated 8 years ago
- Example Scala/SBT event consumer for Amazon Kinesis☆22Updated 10 years ago
- Collection of AWS Lambdas for creating and managing Delta tables☆38Updated this week
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated last year
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 4 years ago
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆27Updated last month
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Accurate counters with Kafka & RocksDB.☆16Updated 4 years ago
- Hive Storage Handler for Kinesis.☆11Updated 10 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year