full360 / glue-sneaql-demo
☆12Updated 3 years ago
Alternatives and similar repositories for glue-sneaql-demo:
Users that are interested in glue-sneaql-demo are comparing it to the libraries listed below
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆46Updated last year
- DynamoDB data source for Apache Spark☆95Updated 3 years ago
- ARCHIVED: Log4J Appender for writing data into a Kinesis Stream☆62Updated 6 years ago
- A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).☆17Updated 9 months ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 4 months ago
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 4 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 9 months ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 6 years ago
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 5 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- A Hivemall wrapper for Spark☆31Updated 8 years ago
- Kinesis spout for Storm☆106Updated 6 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Integrating AWS Lambda with EC2 hosted Relational Databases☆43Updated 8 years ago
- Terraform provider for kafka☆32Updated 5 years ago
- ☆22Updated 5 years ago
- Java and Scala client libraries for Concord☆13Updated 7 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆18Updated 7 years ago
- s3mper - Consistent Listing for S3☆226Updated last year
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆153Updated 8 months ago
- Generate a Redshift .manifest file for a given S3 bucket☆21Updated 7 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated last year