full360 / glue-sneaql-demo
☆12Updated 4 years ago
Alternatives and similar repositories for glue-sneaql-demo:
Users that are interested in glue-sneaql-demo are comparing it to the libraries listed below
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- ☆22Updated 5 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated last year
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆19Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- DynamoDB data source for Apache Spark☆95Updated 3 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 7 months ago
- A Hivemall wrapper for Spark☆31Updated 9 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated 2 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- ☆54Updated 7 years ago
- ARCHIVED: Log4J Appender for writing data into a Kinesis Stream☆62Updated 6 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- A jdbc driver emulates redshift specific commands.☆61Updated 2 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆50Updated last year
- A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).☆17Updated last year
- Spark cloud integration: tests, cloud committers and more☆19Updated 2 months ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 3 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Better, container friendly big-data images for Docker☆39Updated 8 years ago
- Autoscaling EMR clusters and Kinesis streams on Amazon Web Services (AWS)☆47Updated last year
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Random implementation notes☆33Updated 11 years ago