DBeam exports SQL tables into Avro files using JDBC and Apache Beam
☆194Oct 28, 2025Updated 4 months ago
Alternatives and similar repositories for dbeam
Users that are interested in dbeam are comparing it to the libraries listed below
Sorting:
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,620Feb 27, 2026Updated 3 weeks ago
- GCS support for avro-tools, parquet-tools and protobuf☆79May 5, 2025Updated 10 months ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆270Jul 12, 2023Updated 2 years ago
- Export PostgreSQL tables to Google BigQuery☆37Jun 14, 2021Updated 4 years ago
- A unified way of launching Dataflow jobs☆13Apr 17, 2023Updated 2 years ago
- ☆54Aug 3, 2017Updated 8 years ago
- A Scala feature transformation library for data science and machine learning☆473Feb 7, 2025Updated last year
- Open source tools for Google Cloud Storage and Databases.☆63May 1, 2024Updated last year
- gRPC Kotlin template project for getting started building clients and services using Kotlin Coroutines and kroto-plus code generation.☆12Sep 1, 2019Updated 6 years ago
- ☆67Aug 16, 2024Updated last year
- Scala Aggregators used for ML Model metrics monitoring☆91Sep 13, 2023Updated 2 years ago
- Processing Logs at Scale using Cloud Dataflow☆62Mar 18, 2019Updated 7 years ago
- protoc-gen-bq-schema helps you to send your Protocol Buffer messages to BigQuery.☆263Oct 29, 2025Updated 4 months ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Jan 15, 2017Updated 9 years ago
- A PyPI compatible server running on App Engine☆11Nov 13, 2017Updated 8 years ago
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆92May 21, 2022Updated 3 years ago
- ☆81Nov 10, 2023Updated 2 years ago
- ☆15May 2, 2019Updated 6 years ago
- Provides different code samples for Apache Beam and DataFlow☆14Sep 29, 2023Updated 2 years ago
- A wrapper for Hadoop in Scala☆42Jul 18, 2010Updated 15 years ago
- Building Scio from scratch step by step☆20May 20, 2019Updated 6 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 8 years ago
- A tool for data sampling, data generation, and data diffing☆346Jan 8, 2026Updated 2 months ago
- ☆35Mar 22, 2023Updated 2 years ago
- Iceberg is a table format for large, slow-moving tabular data☆490Apr 10, 2023Updated 2 years ago
- TensorFlow TFRecord reader CLI tool☆61Dec 15, 2025Updated 3 months ago
- A lightweight workflow definition library☆155Jul 15, 2022Updated 3 years ago
- Powerful framework providing many useful utilities and features on top of the Scala language.☆15Feb 8, 2017Updated 9 years ago
- AngularJS directives for dojo widgets☆33Apr 18, 2014Updated 11 years ago
- Mirror of Apache livy (Incubating)☆14Feb 11, 2026Updated last month
- Cloud Dataflow Google-provided templates for solving in-Cloud data tasks☆1,288Updated this week
- Quark is a data virtualization engine over analytic databases.☆100Jul 13, 2017Updated 8 years ago
- A collection of Magnolia add-on modules☆182Feb 12, 2026Updated last month
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 10 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- Apache flink☆16Jul 12, 2025Updated 8 months ago
- In-deprecation. For Lenses please check lensesio/lenses-helm-charts. Soon Stream Reactor will also get its own Helm repository.☆70Aug 2, 2020Updated 5 years ago
- HDFS inotify Example☆22Feb 8, 2023Updated 3 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Mar 4, 2024Updated 2 years ago