spotify/dbeam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spotify/dbeam)

spotify / dbeam

DBeam exports SQL tables into Avro files using JDBC and Apache Beam

☆197

Alternatives and similar repositories for dbeam

Users that are interested in dbeam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spotify / scio
View on GitHub
A Scala API for Apache Beam and Google Cloud Dataflow.
☆2,626Jul 14, 2026Updated last week
spotify / spydra
View on GitHub
Ephemeral Hadoop clusters using Google Compute Platform
☆136Mar 31, 2022Updated 4 years ago
spotify / gcs-tools
View on GitHub
GCS support for avro-tools, parquet-tools and protobuf
☆79Jul 14, 2026Updated last week
Powerspace / pg2bq
View on GitHub
Export PostgreSQL tables to Google BigQuery
☆37Jun 14, 2021Updated 5 years ago
spotify / styx
View on GitHub
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
☆271Jul 12, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
QubitProducts / dataflow_launcher
View on GitHub
A unified way of launching Dataflow jobs
☆13Apr 17, 2023Updated 3 years ago
spotify / featran
View on GitHub
A Scala feature transformation library for data science and machine learning
☆475Feb 7, 2025Updated last year
spotify / java-locales
View on GitHub
This library was created to ensure a consistent and culturally relevant localized end-user experience, by leveraging Unicode locale data …
☆11May 20, 2026Updated 2 months ago
marcoferrer / kotlin-coroutines-gRPC-template
View on GitHub
gRPC Kotlin template project for getting started building clients and services using Kotlin Coroutines and kroto-plus code generation.
☆12Sep 1, 2019Updated 6 years ago
GoogleCloudPlatform / pontem
View on GitHub
Open source tools for Google Cloud Storage and Databases.
☆65May 1, 2024Updated 2 years ago
spotify / noether
View on GitHub
Scala Aggregators used for ML Model metrics monitoring
☆93Sep 13, 2023Updated 2 years ago
GoogleCloudPlatform / protoc-gen-bq-schema
View on GitHub
protoc-gen-bq-schema helps you to send your Protocol Buffer messages to BigQuery.
☆264Oct 29, 2025Updated 8 months ago
alexvanboxel / airflow-gcp-examples
View on GitHub
Repository with examples and smoke tests for the GCP Airflow operators and hooks
☆152Jan 15, 2017Updated 9 years ago
googleapis / java-pubsub-group-kafka-connector
View on GitHub
☆49Jun 26, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
GoogleCloudPlatform / processing-logs-using-dataflow
View on GitHub
Processing Logs at Scale using Cloud Dataflow
☆60Mar 18, 2019Updated 7 years ago
macroadster / HMS
View on GitHub
Hadoop Management System
☆16Dec 2, 2025Updated 7 months ago
vendasta / cloudpypi
View on GitHub
A PyPI compatible server running on App Engine
☆12Nov 13, 2017Updated 8 years ago
st3fan / departures-board
View on GitHub
Clojure Transit Tracker
☆20Aug 5, 2022Updated 3 years ago
apache / beam
View on GitHub
Apache Beam is a unified programming model for Batch and Streaming data processing.
☆8,636Updated this week
Talend / beam-samples
View on GitHub
☆80Nov 10, 2023Updated 2 years ago
servian / bigquery-view-analyzer
View on GitHub
A command-line tool for managing permissions and dependencies for BigQuery authorized views
☆92May 21, 2022Updated 4 years ago
rishisinghal / BeamPipelineSamples
View on GitHub
Provides different code samples for Apache Beam and DataFlow
☆14Sep 29, 2023Updated 2 years ago
nevillelyh / scio-deep-dive
View on GitHub
Building Scio from scratch step by step
☆20May 20, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
spotify / ratatool
View on GitHub
A tool for data sampling, data generation, and data diffing
☆349Mar 31, 2026Updated 3 months ago
malike / elasticsearch-kafka-watch
View on GitHub
A custom watcher plugin for Elasticsearch that feeds Apache Kafka
☆11Mar 9, 2018Updated 8 years ago
mkocikowski / kafkaclient
View on GitHub
Golang kafka client based on libkafka
☆11Mar 4, 2026Updated 4 months ago
Netflix / iceberg
View on GitHub
Iceberg is a table format for large, slow-moving tabular data
☆494Apr 10, 2023Updated 3 years ago
ewhauser / flume-kafka-plugin
View on GitHub
☆23Oct 17, 2011Updated 14 years ago
Cascading / cascading-hive
View on GitHub
Integration for Cascading and Apache Hive
☆25Oct 31, 2017Updated 8 years ago
GoogleCloudPlatform / DataflowTemplates
View on GitHub
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
☆1,308Updated this week
apache / livy-website
View on GitHub
Mirror of Apache livy (Incubating)
☆13Jul 7, 2026Updated last week
spotify / magnolify
View on GitHub
A collection of Magnolia add-on modules
☆180Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
qubole / quark
View on GitHub
Quark is a data virtualization engine over analytic databases.
☆101Jul 13, 2017Updated 9 years ago
onefoursix / hdfs-inotify-example
View on GitHub
HDFS inotify Example
☆22Feb 8, 2023Updated 3 years ago
apache / flink-connector-prometheus
View on GitHub
Apache flink
☆16May 15, 2026Updated 2 months ago
donos-community / kind-sir
View on GitHub
Kind sir will merge your merge requests. Properly.
☆13Sep 18, 2020Updated 5 years ago
JimRottinger / looker-vis-builder
View on GitHub
A browser-based IDE for developing Looker custom visualizations
☆10Nov 11, 2022Updated 3 years ago
wepay / kafka-connect-bigquery
View on GitHub
DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector
☆151Mar 4, 2024Updated 2 years ago
JordanEC / RestApiSpringBootExample
View on GitHub
REST API using: Spring Boot + Hibernate + MySQL + Jackson + Retrofit
☆10Jan 22, 2016Updated 10 years ago