dvryaboy / idl_storage_guidelines
This document attempts to capture useful patterns and warn about subtle gotchas when it comes to designing and evolving schemas for long-term serialized data. It is not intended as a guide for how to best represent a particular dataset or process.
☆13Updated 7 years ago
Related projects: ⓘ
- Simple Samza Job Using Confluent Platform☆15Updated 8 years ago
- ☆26Updated 4 years ago
- ☆21Updated last year
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 5 years ago
- ☆27Updated this week
- Kafka Connect Connector for Jenkins Open Source Continuous Integration Tool☆30Updated last year
- Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Updated 11 months ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated last year
- CLI and Go Clients to manage Kafka components (Kafka Connect & SchemaRegistry)☆29Updated 7 years ago
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated last year
- No longer maintained. Use https://eventsizer.io instead.☆62Updated 5 years ago
- ☆14Updated this week
- Bash completion for Kafka command line utilities.☆34Updated 6 years ago
- Playbook to provision a Confluent Cluster☆10Updated 6 years ago
- Repository for advanced unit-testing with embedded kafka services☆25Updated 5 years ago
- Spark job for compacting avro files together☆12Updated 6 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 2 years ago
- Test suite for Kafka Connect connectors based on Landoop's Coyote and docker.☆32Updated 4 years ago
- ☆22Updated 5 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 5 years ago
- Library offering http based query on top of Kafka Streams Interactive Queries☆69Updated last year
- Terraform Modules for Setting up the Confluent Platform in AWS☆12Updated 2 years ago
- Use SQL to transform your avro schema/records☆28Updated 6 years ago
- SQL for Kafka Connectors☆96Updated 8 months ago
- Utility project for working with Kafka Connect.☆34Updated last month
- machine learning playground☆12Updated 7 years ago
- A native Kafka protocol proxy for Apache Kafka☆21Updated 6 years ago