Apache Beam starter repo for Python
☆22Jan 26, 2026Updated last month
Alternatives and similar repositories for beam-starter-python
Users that are interested in beam-starter-python are comparing it to the libraries listed below
Sorting:
- An ETL Pipeline built over GCP and orchestrated by Mage, which involves Extracting Data from GCS Bucket, building Dimensional Model (Star…☆13Aug 26, 2023Updated 2 years ago
- Showcase application for Cassandra database usage with Spring framework and DataStax Java driver☆10Jul 18, 2016Updated 9 years ago
- An ETL framework in Scala for Data Engineers☆23Aug 30, 2022Updated 3 years ago
- Manages Cloud Composer v1 and v2 along with option to manage networking☆56Updated this week
- Let's learn Beam, processing Movie Lens 20m datas. Get top three genres for each user☆14Aug 26, 2018Updated 7 years ago
- Link between Slack and another service with link buttons☆10Jun 1, 2018Updated 7 years ago
- MISeD (Meeting Information Seeking Dialogs dataset) is an information-seeking dialog dataset focused on meeting transcripts. It includes …☆14Nov 20, 2024Updated last year
- Ops files for https//github.com/meta-llama/llama-stack☆17Jun 28, 2025Updated 8 months ago
- Kubernetes object tree editor☆16Oct 25, 2025Updated 4 months ago
- Connecting to Cloud SQL from Dataflow/Apache Beam in Python☆11Oct 31, 2021Updated 4 years ago
- universal-datalakehouse-postgres-ingestion-deltastreamer☆11Apr 7, 2024Updated last year
- CLI tool to manage Kafka connectors☆10Mar 2, 2024Updated 2 years ago
- Playing with different packages of the Apache Spark☆30Feb 8, 2026Updated last month
- learning-by-doing data model built with dbt-core☆16Mar 9, 2026Updated last week
- Pry extension for Sorbet☆14Jul 6, 2020Updated 5 years ago
- Example GitHub Actions for Apache Kafka client application development for local and Confluent Cloud☆15Aug 1, 2022Updated 3 years ago
- Deploy Kafka pipelines to Kubernetes☆14Mar 12, 2026Updated last week
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 7 years ago
- ☆21Jan 20, 2026Updated 2 months ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Nov 9, 2019Updated 6 years ago
- A chrome extension to control Youtube™ videos from any web page☆15Mar 4, 2023Updated 3 years ago
- Looker extension designed to give business users access to BigQuery and Vertex AI's machine learning capabilities.☆18Jun 17, 2025Updated 9 months ago
- How do tech companies rank amongst themselves when it comes to github.com activity?☆17May 2, 2021Updated 4 years ago
- ☆18May 6, 2024Updated last year
- ☆15Sep 29, 2022Updated 3 years ago
- metamask chrome extension☆18Mar 30, 2017Updated 8 years ago
- This sample demonstrates how to make a use of modules provided by Microsoft Azure Blob Service in Python.☆14Aug 30, 2019Updated 6 years ago
- Terraform module to provision a scheduled Lambda function which will delete old AWS ElasticSearch indices☆13Feb 17, 2026Updated last month
- This is building a container from scratch☆30Feb 28, 2022Updated 4 years ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- A nicer UI for AWS Glue Data Catalog☆10Jun 27, 2022Updated 3 years ago
- In which I implement some applications of machine learning techniques.☆32May 10, 2016Updated 9 years ago
- Plugin for the music library manager Beets (http://beets.io/).☆13Dec 23, 2023Updated 2 years ago
- Tools for Analyzing Popularity and Semantic Diversity of a Playlist Dataset☆10Jun 17, 2024Updated last year
- ☆23Jun 24, 2021Updated 4 years ago
- This web scraper is intended to extract data from The Home Depot Website, it could be run locally or in the Apify platform, the latter is…☆10Oct 13, 2022Updated 3 years ago
- Unofficial Google Cloud Workflow emulator☆17Feb 16, 2026Updated last month
- Minimalistic, high-powered Preact boilerplate☆18Jul 7, 2023Updated 2 years ago
- Beets plugin to fetch and store popularity values as flexible item attributes☆11Apr 16, 2018Updated 7 years ago