Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more
☆18Jun 21, 2022Updated 3 years ago
Alternatives and similar repositories for gcp-data-engineering
Users that are interested in gcp-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Aug 23, 2020Updated 5 years ago
- Collection of utilities for working with BigQuery in Apache Beam☆10Nov 13, 2025Updated 7 months ago
- O'Reilly Scala Programming Fundamentals: Methods, Classes, Traits☆13Jul 16, 2018Updated 7 years ago
- Desarrollé un proyecto de ETL sobre archivos de diferentes orígenes (CSV, JSON). Luego, utilicé FastAPI para crear una API que permita re…☆10Dec 9, 2022Updated 3 years ago
- Elasticsearch Terraform module for Google Cloud Platform☆11Feb 14, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Packt courseware source code for "Beginning Data Science with Jupyter"☆15Jan 5, 2020Updated 6 years ago
- This is part of the Artificial Intelligence live course, hosted by Packtpub. In this repository, you can find information to build your e…☆15Feb 19, 2019Updated 7 years ago
- S2X (SPARQL on Spark with GraphX) is a SPARQL query processor for Hadoop based on Spark GraphX. It combines graph-parallel abstraction of…☆15Jun 12, 2018Updated 8 years ago
- ⚡ An Augmented Reality real-world length measuring web application built by the modification of the example being provided by babylonjs -…☆12Sep 24, 2020Updated 5 years ago
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆20Dec 28, 2021Updated 4 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- This Repo contain details related to Data Engineering tech stacks in GCP☆58Apr 18, 2026Updated last month
- Content related to Mastering Postgresql along with videos.☆19Aug 18, 2021Updated 4 years ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Oct 24, 2018Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Repo that will help you explore how to build a hybrid workflow using Apache Airflow and Amazon ECS Anywhere☆11Jul 12, 2022Updated 3 years ago
- Proyecto de juguete para mostrar cómo realizar el setup de un proyecto de data science☆11Nov 24, 2022Updated 3 years ago
- A sample project for KSQL along with debezium and kafka connect☆15Aug 18, 2022Updated 3 years ago
- markup to create labs for courses from the Google Cloud training catalog.☆49Sep 15, 2025Updated 8 months ago
- ☆16Jun 27, 2020Updated 5 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- ☆16Apr 18, 2025Updated last year
- ☆12Feb 11, 2022Updated 4 years ago
- Content for O'Reilly Live Training☆25Sep 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Instagram Clone - NestJs + Typescript and ReactJs☆14Mar 18, 2021Updated 5 years ago
- Exercises and examples developed for the Hadoop with Python tutorial☆21Sep 19, 2019Updated 6 years ago
- Basic REPL for SwiftyLISP☆13Feb 16, 2017Updated 9 years ago
- Reddit Data Science Project Ideas☆11Dec 28, 2019Updated 6 years ago
- CSD for Apache Airflow☆19Aug 20, 2019Updated 6 years ago
- Airbyte deployment and configuration management tool☆12Feb 5, 2022Updated 4 years ago
- A simulator for single molecule FRET experiments of freely diffusing particles. Moved to:☆18Jun 13, 2019Updated 7 years ago
- EfectoBolo Model #MeLiDataChallenge 2020☆22May 24, 2021Updated 5 years ago
- Repositorio utilizado para el Curso de Apache Spark en Platzi☆20Feb 20, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆25Jan 23, 2019Updated 7 years ago
- Slides for "Feature engineering for time series forecasting" talk☆68Nov 11, 2022Updated 3 years ago
- ☆17Oct 26, 2020Updated 5 years ago
- This repo is mostly created for pyspark and hive related interview questions.☆63Jan 6, 2026Updated 5 months ago
- ☆19Jan 18, 2021Updated 5 years ago
- ☆36Aug 24, 2022Updated 3 years ago
- Demo code for PD Tech Fest 2019 showcasing event driven autoscaling using KEDA☆24Feb 15, 2024Updated 2 years ago