strelec / hive-serde-schema-gen
Generate Hive SerDe schema from a .json file.
☆35Updated 8 years ago
Alternatives and similar repositories for hive-serde-schema-gen
Users that are interested in hive-serde-schema-gen are comparing it to the libraries listed below
Sorting:
- functionstest☆33Updated 8 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Kite SDK Examples☆98Updated 4 years ago
- Offline Hadoop Elasticsearch Index Building and Tools For Lambda Architectures☆31Updated last year
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆38Updated 9 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- Single view demo☆14Updated 9 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- NiFi provenance reporting tasks☆14Updated last year
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- Traverse HDFS without jvm startup delays and directory context!! Supports multiple HDFS hosts, command line history and tab completion.☆17Updated 8 years ago
- Low level integration of Spark and Kafka☆130Updated 7 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- ☆92Updated 8 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- Scripts for parsing / making sense of yarn logs☆52Updated 8 years ago
- Reference Architectures for Apache Spark☆38Updated 8 years ago
- ☆10Updated 10 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- ☆14Updated 8 years ago