amazon-ion / ion-hive-serde
A Apache Hive SerDe (short for serializer/deserializer) for the Ion file format.
☆30Updated last week
Alternatives and similar repositories for ion-hive-serde:
Users that are interested in ion-hive-serde are comparing it to the libraries listed below
- Ion Path Extraction API aims to combine the convenience of a DOM API with the speed of a streaming API.☆16Updated 2 months ago
- ☆15Updated 2 months ago
- Use SQL to transform your avro schema/records☆28Updated 7 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated last year
- A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.☆38Updated 9 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Source for the GitHub Pages for Ion.☆23Updated 2 weeks ago
- Amundsen Gremlin☆21Updated 2 years ago
- A Kotlin reference implementation of the Ion Schema Specification.☆26Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- Support for Ion in Intellij IDEA.☆29Updated 5 months ago
- ☆19Updated 5 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- ☆14Updated last month
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- Source for PartiQL-related documents.☆16Updated 9 months ago
- The language specification of PartiQL.☆150Updated last year
- A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)☆71Updated 4 years ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- Amazon EMR on EKS Custom Image CLI☆28Updated 6 months ago
- Java JDBC Driver for easy access of remote SQL databases managed with AceQL HTTP☆27Updated 6 months ago
- This library contains the Kinesis Analytics stream processing runtime configuration classes.☆12Updated 2 months ago
- ☆22Updated 5 years ago
- A new generation of project generators☆9Updated 2 years ago
- Amazon CloudWatch Embedded Metric Format Client Library☆45Updated last month
- The ZetaSQL Toolkit is a library that helps users use ZetaSQL Java API to perform SQL analysis for multiple query engines, including BigQ…☆41Updated last week
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- ☆15Updated 4 years ago
- A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).☆17Updated 11 months ago