This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
☆43Apr 22, 2023Updated 3 years ago
Alternatives and similar repositories for goodreads-elt-pipeline
Users that are interested in goodreads-elt-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆23Nov 19, 2024Updated last year
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆52Dec 2, 2023Updated 2 years ago
- ☆44Mar 9, 2025Updated last year
- ☆17Dec 9, 2022Updated 3 years ago
- Simple Tab Sorter++☆16May 28, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A dbt adapter for Apache Impala & Cloudera Data Platform☆24Mar 30, 2026Updated last month
- NFC Bitcoin Smartcard☆13Nov 4, 2013Updated 12 years ago
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆22Mar 29, 2026Updated last month
- managed debugger for IronPython☆31Apr 8, 2009Updated 17 years ago
- A collection of airflow sample workflows for data processing on aws☆12Dec 1, 2017Updated 8 years ago
- ScaleDP is an Open-Source extension of Apache Spark for Document Processing☆18Dec 2, 2025Updated 5 months ago
- dbt integration for Cube☆16Oct 22, 2025Updated 6 months ago
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Apr 25, 2026Updated 2 weeks ago
- Performant, highly available distributed storage using SeaweedFS in Docker Swarm☆16Jan 10, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Ensemble PhoBERT with FastText Embedding to improve performance on Vietnamese Sentiment Analysis tasks.☆16Jun 29, 2023Updated 2 years ago
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Telegram bot using GPT4 API☆15Aug 26, 2024Updated last year
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- Fivetran's Jira source dbt package☆14Oct 1, 2025Updated 7 months ago
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆35Oct 8, 2024Updated last year
- Feature Flags in dbt models☆35Apr 9, 2026Updated last month
- ☆16Mar 9, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- On-premises ELT Pipeline☆32Jul 10, 2025Updated 10 months ago
- Pass parameters to Jupyter notebooks via URL arguments☆20Dec 9, 2019Updated 6 years ago
- Run your dbt models efficiently using dbt_smart_run☆16Mar 5, 2025Updated last year
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 9 months ago
- Это репозиторий telegram канала по инженерии данных. Собраны материалы: мысли, кейсы и полезные ссылки.☆18Apr 27, 2025Updated last year
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 3 years ago
- JavaCard applet for speaking NDEF☆18Jun 13, 2014Updated 11 years ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- Code to build models that effectively predict promoter-driven gene expression☆12May 15, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- Sample http server and repo set up☆40Apr 25, 2026Updated 2 weeks ago
- ☆12Feb 27, 2024Updated 2 years ago
- ZMK firmware for Urchin and Corne 36 keyboard with nice!nano and nice!view☆17Jan 16, 2026Updated 3 months ago
- ☆17Oct 15, 2021Updated 4 years ago
- An implementation of Pregel framework and graph algorithms on top of it with Ibis project DataFrames.☆23Apr 7, 2025Updated last year
- ☆19Jul 27, 2023Updated 2 years ago