Data lake, data warehouse on GCP
☆58Dec 28, 2021Updated 4 years ago
Alternatives and similar repositories for cloud-data-lake
Users that are interested in cloud-data-lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- This repo helps bootstrap the infrastructures with a modern data stack on Google Cloud Platform using Terraform.☆123Mar 11, 2022Updated 4 years ago
- ☆42Jun 18, 2020Updated 5 years ago
- Written python files to work with pNEUMA dataset☆22May 18, 2021Updated 5 years ago
- StarSnow: HTTP Client for Snowflake database (HTTP get/post from SQL)☆26Oct 6, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Plática y demostración de como integrar Tensorflow con R☆16May 23, 2019Updated 7 years ago
- ☆29Aug 17, 2018Updated 7 years ago
- Property Casualty Data Model Specification☆35Jun 22, 2022Updated 3 years ago
- ☆16Jan 8, 2022Updated 4 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Feb 21, 2026Updated 3 months ago
- Collect NBA injuries report, organize them in an elegant table, then send it via mail☆10Jan 12, 2021Updated 5 years ago
- A comprehensive set of calendar table value functions, for use in calendar dimensions or other applications.☆13Sep 10, 2020Updated 5 years ago
- This repo contains DAGs demonstrating a variety of ELT patterns using Airflow along with dbt.☆12Jan 12, 2023Updated 3 years ago
- Web Scraping Tutorial with Scrapy and Python for Beginners, published by Packt☆37Jan 18, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- merge two sorted lists fast☆12Nov 15, 2023Updated 2 years ago
- SCD Merge Wizard is an application which will help you generate T-SQL statement for merging data from two tables into one table in minute …☆44Sep 4, 2024Updated last year
- ☆12Jun 3, 2023Updated 2 years ago
- dbt-generator - Generate and transform base models for dbt project☆50Dec 15, 2022Updated 3 years ago
- ☆10Apr 16, 2021Updated 5 years ago
- Kaggle machine learning competition submission for March Madness 2018☆10Jul 12, 2018Updated 7 years ago
- ☆10Jan 28, 2025Updated last year
- Analysing World bank Data☆14Apr 14, 2019Updated 7 years ago
- All sorts of things supporting blog posts... Sub folders per blog post title.☆40Jan 30, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆10Oct 20, 2022Updated 3 years ago
- This repo demonstrates how to capture any incoming request and write it as JSON to nginx log using Nginx and Lua. For more details refer …☆12May 22, 2017Updated 9 years ago
- List customize [dot] files config.☆11May 14, 2025Updated last year
- Limit Order Book Convolutional Neural Network trading bot☆14Jul 24, 2022Updated 3 years ago
- Matching messy Pandas columns with FuzzyWuzzy (Medium Article)☆13Sep 29, 2019Updated 6 years ago
- Cloud Dataproc: Samples and Utils☆11Sep 23, 2020Updated 5 years ago
- Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti☆16Sep 28, 2020Updated 5 years ago
- Análise de Dados Abertos da Prova Brasil 2011 com Airflow, S3, Redshift e Metabase☆15Jun 28, 2023Updated 2 years ago
- [MOVED to Data Engine Thinking] A library for data warehouse and data integration pattern and architecture documentation.☆51Jan 30, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A linter for Scrapy projects.☆22Feb 25, 2026Updated 2 months ago
- This is a public repository that the dbt proserv team uses for collective demos.☆15Mar 20, 2026Updated 2 months ago
- mtcars dataset☆13Nov 27, 2018Updated 7 years ago
- This repo contains implementation of various functionalities of various message queues in Python.☆13Aug 13, 2020Updated 5 years ago
- Your Top Spotify Listening Habits, Favorite Artists, and Song Recommendations in a Playlist🎧🎶☆19May 9, 2026Updated 2 weeks ago
- Showcase app for Theming (Custom Theme, Blue)☆12Apr 26, 2022Updated 4 years ago
- Tutorial for building a POC Kafka + Spark Streaming pipeline from scratch☆35Dec 23, 2019Updated 6 years ago