pabloem / awesome-beamView external linksLinks
A curated list of awesome resources for Apache Beam
☆145Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-beam
Users that are interested in awesome-beam are comparing it to the libraries listed below
Sorting:
- Repository for Beam College sessions☆112Apr 20, 2021Updated 4 years ago
- ☆81Nov 10, 2023Updated 2 years ago
- Collection of transforms for the Apache beam python SDK.☆90Dec 7, 2023Updated 2 years ago
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,475Updated this week
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,613Updated this week
- Python dataflow pipeline☆10May 12, 2019Updated 6 years ago
- Apache Beam Site☆30Jan 22, 2026Updated 3 weeks ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- ☆42Jun 25, 2020Updated 5 years ago
- ☆47May 3, 2024Updated last year
- Kafka to Avro Writer based on Apache Beam. It's a generic solution that reads data from multiple kafka topics and stores it on in cloud s…☆25Apr 7, 2021Updated 4 years ago
- Kafka Streams + Memcached (e.g. AWS ElasticCache) for low-latency in-memory lookups☆13Nov 4, 2019Updated 6 years ago
- Config for monitoring Cassandra using Telegraf's "Jolokia" plugin☆12Dec 27, 2018Updated 7 years ago
- Let's learn Beam, processing Movie Lens 20m datas. Get top three genres for each user☆14Aug 26, 2018Updated 7 years ago
- A tool for data sampling, data generation, and data diffing☆344Jan 8, 2026Updated last month
- Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.☆88Dec 19, 2025Updated last month
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆13Mar 20, 2023Updated 2 years ago
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Apr 27, 2019Updated 6 years ago
- Cloud Dataflow Google-provided templates for solving in-Cloud data tasks☆1,268Updated this week
- An open source framework for building data analytic applications.☆784Feb 2, 2026Updated last week
- hello-streams :: Introducing the stream-first mindset☆16Mar 5, 2024Updated last year
- A kubernetes operator to deploy and auto-scale KafkaConnect Application.☆20Feb 25, 2023Updated 2 years ago
- OGM☆19May 25, 2017Updated 8 years ago
- Stellate and urql like GraphQL Cache in Go☆23Oct 2, 2024Updated last year
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆134Mar 31, 2022Updated 3 years ago
- Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially…☆2,993Updated this week
- ☆24Feb 4, 2021Updated 5 years ago
- Apache Beam Example 中国开源社区☆21Jul 1, 2022Updated 3 years ago
- Use Vagrant and Ambari Blueprint API to install PivotalHD 3.0 (or Hortonworks HDP2.x) Hadoop cluster with HAWQ 1.3 (SQL on Hadoop) and Sp…☆23Jul 20, 2016Updated 9 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Oct 9, 2023Updated 2 years ago
- Kafka Connect Transform to copy Avro schemas between Schema Registries☆61Oct 6, 2023Updated 2 years ago
- A RDBMS with Immutable Schema feature☆35Oct 13, 2021Updated 4 years ago
- It's me!☆10Nov 12, 2021Updated 4 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Jul 21, 2023Updated 2 years ago
- A template for writing a Nomad driver plugin.☆33Jan 5, 2026Updated last month
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆117Jul 4, 2021Updated 4 years ago
- Examples using Google Cloud Dataflow - Apache Beam☆35Dec 8, 2022Updated 3 years ago
- This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab☆63Oct 25, 2019Updated 6 years ago