AzimoLabs / kafka-to-avro-writer
Kafka to Avro Writer based on Apache Beam. It's a generic solution that reads data from multiple kafka topics and stores it on in cloud storage in Avro format.
☆25Updated 4 years ago
Alternatives and similar repositories for kafka-to-avro-writer:
Users that are interested in kafka-to-avro-writer are comparing it to the libraries listed below
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated 2 years ago
- ☆26Updated 5 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- A fork of the Apache Kafka "connect-file" Kafka Connect, to use as a starting point to write your own Kafka connectors.☆37Updated 7 years ago
- ☆81Updated last year
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 4 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Used to generate mock Avro data☆15Updated 6 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- A collection of examples and use-cases for Kafka Streams☆64Updated 8 years ago
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 6 years ago
- functionstest☆33Updated 8 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Updated last year
- Cascading on Apache Flink®☆54Updated last year
- Utility project for working with Kafka Connect.☆34Updated 9 months ago
- ☆67Updated 6 years ago
- ☆22Updated 6 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- ☆21Updated 2 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 3 months ago
- Fast Apache Avro serialization/deserialization library☆43Updated 4 years ago
- A user friendly API for checking for and reporting on Avro schema incompatibilities.☆59Updated last year
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- This will help you to generate AVRO schema from JSON schema.☆34Updated 2 years ago
- Simple Lambda Architecture implementation based on Apache Spark (Core, SQL, Streaming)☆40Updated 8 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆70Updated 2 years ago
- Web Based Kafka Consumer and Producer☆69Updated 5 years ago
- These are some code examples☆55Updated 5 years ago