DBeam exports SQL tables into Avro files using JDBC and Apache Beam
☆196Apr 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for dbeam
Users that are interested in dbeam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,623Apr 13, 2026Updated 2 weeks ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆271Jul 12, 2023Updated 2 years ago
- Export PostgreSQL tables to Google BigQuery☆37Jun 14, 2021Updated 4 years ago
- Useful Cloud Dataflow custom templates.☆16Dec 14, 2022Updated 3 years ago
- A Scala feature transformation library for data science and machine learning☆474Feb 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- gRPC Kotlin template project for getting started building clients and services using Kotlin Coroutines and kroto-plus code generation.☆12Sep 1, 2019Updated 6 years ago
- Processing Logs at Scale using Cloud Dataflow☆62Mar 18, 2019Updated 7 years ago
- protoc-gen-bq-schema helps you to send your Protocol Buffer messages to BigQuery.☆264Oct 29, 2025Updated 6 months ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Jan 15, 2017Updated 9 years ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆297Jan 31, 2025Updated last year
- ☆49Apr 20, 2026Updated last week
- An example Dataform project to load and transform the publicly available dataset from IMDB.☆10Apr 27, 2024Updated 2 years ago
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,558Updated this week
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆91May 21, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆80Nov 10, 2023Updated 2 years ago
- Provides different code samples for Apache Beam and DataFlow☆14Sep 29, 2023Updated 2 years ago
- Building Scio from scratch step by step☆20May 20, 2019Updated 6 years ago
- Flyte Flink k8s plugin.☆20Apr 23, 2026Updated last week
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 8 years ago
- A tool for data sampling, data generation, and data diffing☆346Mar 31, 2026Updated last month
- Yet-Another-Rules-Engine -- A easy-to-understand Business Readable DSL for defining production rules.☆14Mar 24, 2021Updated 5 years ago
- App Engine TCK☆48Dec 20, 2021Updated 4 years ago
- Iceberg is a table format for large, slow-moving tabular data☆492Apr 10, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Replicates data between Google Cloud BigQuery projects☆22Jul 13, 2016Updated 9 years ago
- Powerful framework providing many useful utilities and features on top of the Scala language.☆15Feb 8, 2017Updated 9 years ago
- Airflow-Salesforce connector☆16Jul 5, 2017Updated 8 years ago
- The Spark::Form Perl module for effortlessly handling forms.☆14Aug 16, 2011Updated 14 years ago
- This repo contains the LookML for the model and dashboards used with the FHIR healthcare dataset to showcase how Looker can add value to …☆14Jan 5, 2023Updated 3 years ago
- Mirror of Apache livy (Incubating)☆13Feb 11, 2026Updated 2 months ago
- Cloud Dataflow Google-provided templates for solving in-Cloud data tasks☆1,289Updated this week
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago
- A collection of Magnolia add-on modules☆182Feb 12, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- Apache flink☆17Jul 12, 2025Updated 9 months ago
- Experiments in Streaming☆60Aug 27, 2016Updated 9 years ago
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 10 years ago
- HDFS inotify Example☆22Feb 8, 2023Updated 3 years ago
- In-deprecation. For Lenses please check lensesio/lenses-helm-charts. Soon Stream Reactor will also get its own Helm repository.☆70Aug 2, 2020Updated 5 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Mar 4, 2024Updated 2 years ago