Data Pipeline from the Global Historical Climatology Network DataSet
☆27Dec 20, 2022Updated 3 years ago
Alternatives and similar repositories for ghcn-d
Users that are interested in ghcn-d are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solutions for Data Engineering Zoomcamp, Winter 2022.☆16Apr 22, 2022Updated 4 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 4 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 9 months ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Mar 30, 2024Updated 2 years ago
- Free High-Quality Financial Data in Azure☆11Jun 15, 2024Updated last year
- ☆20Apr 3, 2024Updated 2 years ago
- Repository containing example solutions for the Data Engineering Career Path Portfolio Projects☆18Sep 16, 2022Updated 3 years ago
- ☆20Jan 23, 2023Updated 3 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Feb 3, 2024Updated 2 years ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- In this project I used apache airflow to scrape website periodically. This is for the tutorials I do on youtube.☆10Nov 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Exploratory Data Analysis and Data Visualisation of All Space Missions from 1957 Dataset.☆12Jun 15, 2021Updated 4 years ago
- Get started setting up infrastructure as code on Google Cloud Platform☆11Jun 13, 2021Updated 4 years ago
- An end-to-end data engineering pipeline to create a dashboard for the latest content on the r/Stocks subreddit☆20Aug 5, 2022Updated 3 years ago
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆28Oct 13, 2023Updated 2 years ago
- How to Automate SQL: dbt(data build tool) tutorial on bigquery with extensive NOTES☆33Apr 13, 2026Updated last month
- The Christmas Project is a festive-themed data engineering initiative designed to integrate and analyze diverse datasets, creating a comp…☆19Jan 11, 2025Updated last year
- Portfolio Site☆19Dec 28, 2025Updated 4 months ago
- Processing TfL data for bike usage with Google Cloud Platform.☆45Jul 15, 2022Updated 3 years ago
- Self-improving AI agents using Agentic Context Engineering - A starter implementation with Google ADK☆21Oct 23, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆29Apr 12, 2023Updated 3 years ago
- Streamlit-Powered DataTalksClub Project Analyzer: Interactive Insights at Your Fingertips☆22May 5, 2026Updated 2 weeks ago
- ☆12Jul 15, 2024Updated last year
- Simple ETL pipeline using Python☆29May 22, 2023Updated 3 years ago
- ☆11Nov 18, 2022Updated 3 years ago
- My masters thesis work: predicting a river's natural flow using a variety of machine learning techniques☆10Sep 19, 2016Updated 9 years ago
- This course introduced me to three cutting-edge technologies for privacy-preserving AI: Federated Learning, Differential Privacy, and Enc…☆11Sep 2, 2019Updated 6 years ago
- This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loaded…☆30Jun 14, 2024Updated last year
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Apr 25, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains code and configuration files for an Extract, Transform, Load (ETL) project using Google Cloud Data Fusion for da…☆20Feb 23, 2024Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- Create a Machine Learning model which suggests a location to open a Cafe in Melbourne, Australia☆16Aug 8, 2021Updated 4 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆11Sep 4, 2025Updated 8 months ago
- Cardano mainchain data on BigQuery☆11Aug 3, 2023Updated 2 years ago
- 👧 Greta is an agile voice assistant to help reduce your carbon footprint.☆13Apr 24, 2023Updated 3 years ago