lelouvincx/goodreads-elt-pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lelouvincx/goodreads-elt-pipeline)

lelouvincx / goodreads-elt-pipeline

This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)

☆44

Alternatives and similar repositories for goodreads-elt-pipeline

Users that are interested in goodreads-elt-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

longNguyen010203 / Youtube-Recommend-Master-ETL-Pipeline
View on GitHub
A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…
☆25Nov 19, 2024Updated last year
dagster-io / hooli-data-eng-pipelines
View on GitHub
Example Dagster Cloud code for the Hooli Data Engineering organization.
☆26Jul 23, 2026Updated last week
petehunt / dagster-github-stars-example
View on GitHub
☆17Dec 9, 2022Updated 3 years ago
danhphan / trusted-data-pipeline
View on GitHub
Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb
☆24Aug 17, 2023Updated 2 years ago
klimpie94 / pyspark-etl-analytics
View on GitHub
This repo contains code examples of processing and analysing data with Apache Spark and Python
☆10Oct 21, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
luchonaveiro / real-estate-prices-etl
View on GitHub
ETL to scrape a real estate website, process house prices and data, and build an ML model of the house prices.
☆16Jul 11, 2022Updated 4 years ago
ychantit / airflow_aws_utils
View on GitHub
A collection of airflow sample workflows for data processing on aws
☆12Dec 1, 2017Updated 8 years ago
HwaiTengTeoh / Flight-Delays-Prediction-Using-Machine-Learning-Approach
View on GitHub
Flight delays prediction and analysis: Machine Learning Approach
☆14Oct 7, 2022Updated 3 years ago
luongphambao / mlops-diabetes-prediction
View on GitHub
☆51Aug 14, 2024Updated last year
VNOpenAI / vn-accent
View on GitHub
Add accent for Vietnamese. N-Grams + Beam search, LSTM, Transformer, Evolved Transformer
☆18Feb 3, 2021Updated 5 years ago
aws-samples / serverless-datalake
View on GitHub
Serverless Datalake architecture
☆16Updated this week
SemyonSinchenko / flake8-pyspark-with-column
View on GitHub
A flake8 plugin that detects of usage withColumn in a loop or inside reduce
☆28Jun 20, 2025Updated last year
christianhujer / jcardmock
View on GitHub
Mock implementation of the Java Card API 3.0.4 in order to test Java Card applet code without a card or simulator.
☆15May 6, 2019Updated 7 years ago
AntonFriberg / dagster-project-example
View on GitHub
An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…
☆101Nov 3, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ssp-data / practical-data-engineering
View on GitHub
Practical Data Engineering: A Hands-On Real-Estate Project Guide
☆817Jun 25, 2026Updated last month
stewartbryson / dbt-tpcdi
View on GitHub
A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.
☆11Sep 4, 2025Updated 10 months ago
kzzzr / mybi-dbt-core
View on GitHub
dbt module for myBI connect
☆13Jan 31, 2023Updated 3 years ago
fivetran / dbt_smart_run
View on GitHub
Run your dbt models efficiently using dbt_smart_run
☆17Mar 5, 2025Updated last year
katiehuangx / Learn-SQL
View on GitHub
☆20Apr 20, 2026Updated 3 months ago
cseeyangchen / Neuron
View on GitHub
【CVPR 25】This is an official PyTorch code of "Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recog…
☆20Dec 4, 2025Updated 7 months ago
z3z1ma / dbt-feature-flags
View on GitHub
Feature Flags in dbt models
☆35Jul 5, 2026Updated 3 weeks ago
prakashdontaraju / google-cloud-ecommerce
View on GitHub
ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…
☆11Mar 9, 2022Updated 4 years ago
DorsaRoh / LungAI
View on GitHub
Deep learning model to detect lung cancer & classify between 4 types of tumors. 4x Hackathon Winner
☆20Oct 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pracdata / duckdb-pipeline
View on GitHub
Demonstrating the capabilities of DuckDB as a transformation engine for data lakes
☆34Oct 8, 2024Updated last year
ongxuanhong / de03-trino-dbt-spark-everything-everywhere-all-at-once
View on GitHub
☆16Updated this week
cube-js / cube_dbt
View on GitHub
dbt integration for Cube
☆17Oct 22, 2025Updated 9 months ago
slomo / ndef-javacard
View on GitHub
JavaCard applet for speaking NDEF
☆19Jun 13, 2014Updated 12 years ago
gtancev / Medical-Language-Model-Learner
View on GitHub
This application guides you through the development of a language model that classifies clinical documents according to their medical spe…
☆12Aug 12, 2024Updated last year
anikethjr / promoter_models
View on GitHub
Code to build models that effectively predict promoter-driven gene expression
☆12May 15, 2025Updated last year
gbdev / virens
View on GitHub
Homebrew Hub web frontend, in Nuxt 3. Powered by webassembly builds of binjgb and mGBA.
☆21Jun 3, 2026Updated last month
henryzhao5852 / DistDR
View on GitHub
☆12Oct 10, 2021Updated 4 years ago
yilunzhao / RobuT
View on GitHub
Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"
☆15Feb 8, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Xnhyacinth / NesyCD
View on GitHub
[AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
☆12Jun 19, 2025Updated last year
sonbaoharryson / Data_Engineer_JobPulse_Project
View on GitHub
This project is built for personal purposes. Currently, I am enahancing it with an interactive bot that can recommend and help you improv…
☆26Updated this week
tunguyenn99 / son-tung-mtp-analytics
View on GitHub
An end-2-end project about Son Tung M-TP
☆27Sep 25, 2025Updated 10 months ago
dowhiledev / nutshell
View on GitHub
Nutshell is an enhanced Unix shell that provides a simplified command language, package management, and AI-powered assistance.
☆24Mar 20, 2025Updated last year
neo-project / neo-debugger
View on GitHub
Neo Smart Contract Debugger for Visual Studio Code
☆23Nov 3, 2023Updated 2 years ago
freemansoft / docker-scripts
View on GitHub
Some docker-compose and other scripts for a couple COTS / open source products
☆11Updated this week
rohitrsp898 / Basic_ETL_PySpark
View on GitHub
☆21Mar 26, 2023Updated 3 years ago