grisha / json2avroLinks
Fast JSON to Avro converter
☆61Updated 6 years ago
Alternatives and similar repositories for json2avro
Users that are interested in json2avro are comparing it to the libraries listed below
Sorting:
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆155Updated 3 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- ☆68Updated 9 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆328Updated 3 years ago
- A connector for SingleStore and Spark☆162Updated 2 weeks ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 9 years ago
- An Apache Storm IMetricsConsumer that forwards Storm's built-in metrics to a Graphite server for real-time graphing, visualization, and o…☆76Updated 2 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆76Updated 3 years ago
- Avro to JSON Schema, and back☆134Updated last year
- Fork of Cloudera Impala separated from Hadoop☆42Updated 9 years ago
- Delimited file loader for Cassandra☆198Updated 6 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆97Updated 5 years ago
- ☆76Updated 10 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆154Updated last year
- A Bulk Data Pipeline out of Cassandra☆323Updated 6 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 2 years ago
- Hadoop output committers for S3☆111Updated 5 years ago
- Low level integration of Spark and Kafka☆130Updated 7 years ago
- Read SparkSQL parquet file as RDD[Protobuf]☆93Updated 6 years ago
- Trifecta is a web-based and CLI tool that simplifies inspecting Kafka messages and Zookeeper data. Additionally, the CLI tool provides th…☆215Updated 6 years ago
- Metrics produced to Kafka and consumers for monitoring them☆101Updated 10 years ago
- s3mper - Consistent Listing for S3☆229Updated 2 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 10 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆260Updated last year
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated 2 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- A library to expose more of Apache Spark's metrics system☆146Updated 5 years ago