DBeam exports SQL tables into Avro files using JDBC and Apache Beam
☆193Oct 28, 2025Updated 4 months ago
Alternatives and similar repositories for dbeam
Users that are interested in dbeam are comparing it to the libraries listed below
Sorting:
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆269Jul 12, 2023Updated 2 years ago
- Export PostgreSQL tables to Google BigQuery☆37Jun 14, 2021Updated 4 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,615Feb 12, 2026Updated 2 weeks ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Mar 31, 2022Updated 3 years ago
- GCS support for avro-tools, parquet-tools and protobuf☆79May 5, 2025Updated 9 months ago
- gRPC Kotlin template project for getting started building clients and services using Kotlin Coroutines and kroto-plus code generation.☆12Sep 1, 2019Updated 6 years ago
- A PyPI compatible server running on App Engine☆11Nov 13, 2017Updated 8 years ago
- ☆54Aug 3, 2017Updated 8 years ago
- A Scala feature transformation library for data science and machine learning☆474Feb 7, 2025Updated last year
- Useful Cloud Dataflow custom templates.☆16Dec 14, 2022Updated 3 years ago
- Open source tools for Google Cloud Storage and Databases.☆63May 1, 2024Updated last year
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Jan 15, 2017Updated 9 years ago
- AngularJS directives for dojo widgets☆33Apr 18, 2014Updated 11 years ago
- A unified way of launching Dataflow jobs☆13Apr 17, 2023Updated 2 years ago
- Axon Framework extension for Spring Cloud's Discovery mechanism integration to distribute Command messages.☆27Feb 12, 2026Updated 2 weeks ago
- Processing Logs at Scale using Cloud Dataflow☆62Mar 18, 2019Updated 6 years ago
- Building Scio from scratch step by step☆20May 20, 2019Updated 6 years ago
- Powerful framework providing many useful utilities and features on top of the Scala language.☆15Feb 8, 2017Updated 9 years ago
- OpenCLIP photo index and search application☆10May 11, 2023Updated 2 years ago
- Apache flink☆16Jul 12, 2025Updated 7 months ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- This repository contains a simple demo application showing the usage of Micrometer Tracing with Kotlin and Spring Boot WebFlux.☆11Jan 7, 2023Updated 3 years ago
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago
- HDFS inotify Example☆22Feb 8, 2023Updated 3 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Nov 14, 2019Updated 6 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 5 years ago
- syslog module for nginx☆18Sep 19, 2010Updated 15 years ago
- Experimentation around LLM and MicroProfile☆17Nov 20, 2025Updated 3 months ago
- Stream Avro SpecificRecord objects in BigQuery using Cloud Dataflow☆13Jan 4, 2022Updated 4 years ago
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 10 years ago
- Demo Spring Boot application illustrating usage of an externalized configuration☆14Feb 19, 2026Updated last week
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆296Jan 31, 2025Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- ☆11Jun 10, 2016Updated 9 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Fork of Cloudera Impala separated from Hadoop☆42Jul 13, 2016Updated 9 years ago
- Scala Aggregators used for ML Model metrics monitoring☆91Sep 13, 2023Updated 2 years ago