anemos-io / protobeam
☆22Updated 5 years ago
Alternatives and similar repositories for protobeam:
Users that are interested in protobeam are comparing it to the libraries listed below
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Use SQL to transform your avro schema/records☆28Updated 7 years ago
- A highly available and infinitely scalable, drop-in replacement for Kafka Streams☆16Updated this week
- A protobuf schema registry on steroids. It will keep track of the contracts throughout your organization, making sure no contract is brok…☆43Updated 4 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Fast Apache Avro serialization/deserialization library☆43Updated 4 years ago
- Lenses.io JDBC driver for Apache Kafka☆20Updated 3 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated 3 weeks ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆44Updated 10 months ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Kafka to Avro Writer based on Apache Beam. It's a generic solution that reads data from multiple kafka topics and stores it on in cloud s…☆25Updated 3 years ago
- Apache Amaterasu☆56Updated 5 years ago
- ☆50Updated 4 years ago
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 5 years ago
- Example: Convert Protobuf to Parquet using parquet-avro and avro-protobuf☆30Updated 9 years ago
- ☆36Updated 2 years ago
- A Transactional Metadata Store Backed by Apache Kafka☆22Updated last week
- Java/Scala library for easily authoring Flyte tasks and workflows☆44Updated 4 months ago
- ☆23Updated 3 weeks ago
- Scala API for Apache Spark SQL high-order functions☆14Updated last year
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆111Updated 4 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30Updated 2 months ago
- Kubernetes Operator for the Ververica Platform☆35Updated 2 years ago
- GCS support for avro-tools, parquet-tools and protobuf☆74Updated this week
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Kafka Streams + Memcached (e.g. AWS ElasticCache) for low-latency in-memory lookups☆13Updated 5 years ago