tubular / confluent-spark-avro
Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
☆18Updated 7 years ago
Alternatives and similar repositories for confluent-spark-avro:
Users that are interested in confluent-spark-avro are comparing it to the libraries listed below
- POC: Spark consumer for bottledwater-pg Kafka Avro topics☆17Updated 4 years ago
- ☆26Updated 5 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Flink Examples☆39Updated 8 years ago
- Cascading on Apache Flink®☆54Updated last year
- functionstest☆33Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 2 weeks ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- Common utilities for Apache Kafka☆36Updated last year
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- ☆21Updated 9 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- Library offering http based query on top of Kafka Streams Interactive Queries☆69Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated 11 months ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Updated last year
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Updated last year
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Updated 5 years ago
- A Kafka Streams process to convert __consumer_offsets to a JSON-readable topic☆13Updated 5 years ago
- Cloudbreak Deployer Tool☆34Updated last year
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated 11 months ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago