DBeam exports SQL tables into Avro files using JDBC and Apache Beam
☆195Oct 28, 2025Updated 5 months ago
Alternatives and similar repositories for dbeam
Users that are interested in dbeam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,621Apr 1, 2026Updated last week
- GCS support for avro-tools, parquet-tools and protobuf☆80May 5, 2025Updated 11 months ago
- Ephemeral Hadoop clusters using Google Compute Platform☆136Mar 31, 2022Updated 4 years ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆271Jul 12, 2023Updated 2 years ago
- Export PostgreSQL tables to Google BigQuery☆37Jun 14, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A unified way of launching Dataflow jobs☆13Apr 17, 2023Updated 2 years ago
- ☆54Aug 3, 2017Updated 8 years ago
- A Scala feature transformation library for data science and machine learning☆473Feb 7, 2025Updated last year
- Open source tools for Google Cloud Storage and Databases.☆63May 1, 2024Updated last year
- Scala Aggregators used for ML Model metrics monitoring☆92Sep 13, 2023Updated 2 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆152Jan 15, 2017Updated 9 years ago
- ☆48Mar 28, 2026Updated 2 weeks ago
- An example Dataform project to load and transform the publicly available dataset from IMDB.☆10Apr 27, 2024Updated last year
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,545Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆91May 21, 2022Updated 3 years ago
- ☆80Nov 10, 2023Updated 2 years ago
- ☆15May 2, 2019Updated 6 years ago
- Provides different code samples for Apache Beam and DataFlow☆14Sep 29, 2023Updated 2 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 8 years ago
- A tool for data sampling, data generation, and data diffing☆346Mar 31, 2026Updated last week
- ☆35Mar 22, 2023Updated 3 years ago
- Iceberg is a table format for large, slow-moving tabular data☆490Apr 10, 2023Updated 3 years ago
- TensorFlow TFRecord reader CLI tool☆61Dec 15, 2025Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A lightweight workflow definition library☆156Jul 15, 2022Updated 3 years ago
- Powerful framework providing many useful utilities and features on top of the Scala language.☆15Feb 8, 2017Updated 9 years ago
- Airflow-Salesforce connector☆16Jul 5, 2017Updated 8 years ago
- AngularJS directives for dojo widgets☆33Apr 18, 2014Updated 11 years ago
- Mirror of Apache livy (Incubating)☆14Feb 11, 2026Updated last month
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago
- A collection of Magnolia add-on modules☆182Feb 12, 2026Updated last month
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- Apache flink☆16Jul 12, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Experiments in Streaming☆60Aug 27, 2016Updated 9 years ago
- In-deprecation. For Lenses please check lensesio/lenses-helm-charts. Soon Stream Reactor will also get its own Helm repository.☆70Aug 2, 2020Updated 5 years ago
- HDFS inotify Example☆22Feb 8, 2023Updated 3 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Mar 4, 2024Updated 2 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. This re…☆167Jul 25, 2018Updated 7 years ago
- syslog module for nginx☆18Sep 19, 2010Updated 15 years ago