dvryaboy / idl_storage_guidelinesLinks

This document attempts to capture useful patterns and warn about subtle gotchas when it comes to designing and evolving schemas for long-term serialized data. It is not intended as a guide for how to best represent a particular dataset or process.

☆13

Alternatives and similar repositories for idl_storage_guidelines

Users that are interested in idl_storage_guidelines are comparing it to the libraries listed below

Sorting:

rayokota / kafka-graphs
Graph Analytics with Apache Kafka
☆106Updated this week
liquidm / druid-dumbo
☆21Updated 2 years ago
farmdawgnation / kafka-hawk
An application that records stats about consumer group offset commits and reports them as prometheus metrics
☆14Updated 6 years ago
bullet-db / bullet-core
Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…
☆41Updated 2 years ago
nezihyigitbasi / FlinkParquet
Using the Parquet file format (with Avro) to process data with Apache Flink
☆14Updated 10 years ago
QuickSign / kafka-encryption
Kafka End to End Encryption
☆53Updated 2 years ago
nevillelyh / scio-deep-dive
Building Scio from scratch step by step
☆20Updated 6 years ago
DataDog / spark-jvm-profiler
## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends
☆15Updated 2 years ago
smartcat-labs / ranger
Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database
☆61Updated 5 years ago
indix / schemer
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
☆112Updated 5 years ago
theduderog / hello-samza-confluent
Simple Samza Job Using Confluent Platform
☆14Updated 9 years ago
wushujames / kafka-utilities
☆26Updated 5 years ago
jkorab / kafka-cloud-calculator
No longer maintained. Use https://eventsizer.io instead.
☆62Updated 6 years ago
lensesio / kafka-connect-query-language
SQL for Kafka Connectors
☆99Updated last year
dataArtisans / cascading-flink
Cascading on Apache Flink®
☆54Updated last year
lensesio / kafka-connect-tools
Kafka Connect Tooling
☆117Updated 4 years ago
hbutani / spark-datetime
functionstest
☆33Updated 9 years ago
steveloughran / zero-rename-committer
Paper: A Zero-rename committer for object stores
☆20Updated 2 weeks ago
conduktor / conduktor-gateway-demos
Demos using Conduktor Gateway
☆18Updated last year
AbsaOSS / atum
A dynamic data completeness and accuracy library at enterprise scale for Apache Spark
☆29Updated 11 months ago
TIBCOSoftware / snappy-examples
Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.
☆32Updated 3 years ago
schema-repo / schema-repo
The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.
☆154Updated 3 years ago
spotify / spydra
Ephemeral Hadoop clusters using Google Compute Platform
☆134Updated 3 years ago
salesforce / mirus
Mirus is a cross data-center data replication tool for Apache Kafka
☆206Updated 4 months ago
rmoff / vsc-ksql
KSQL Syntax Highlighting for VSCode
☆17Updated 2 years ago
jcustenborder / connect-utils
Utility project for working with Kafka Connect.
☆34Updated last year
PBWebMedia / airflow-prometheus-exporter
Export Airflow metrics (from mysql) in prometheus format
☆29Updated 6 months ago
AzimoLabs / kafka-to-avro-writer
Kafka to Avro Writer based on Apache Beam. It's a generic solution that reads data from multiple kafka topics and stores it on in cloud s…
☆25Updated 4 years ago
AdRoll / cantor
Cantor provides utilities for estimating the cardinality of large sets.
☆83Updated 3 years ago
lensesio / coyote
Environment, operations and runtime-meta testing tool.
☆90Updated 4 years ago