This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)
☆43Apr 22, 2023Updated 2 years ago
Alternatives and similar repositories for goodreads-elt-pipeline
Users that are interested in goodreads-elt-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Mar 9, 2025Updated last year
- ☆17Dec 9, 2022Updated 3 years ago
- Simple Tab Sorter++☆16May 28, 2025Updated 10 months ago
- ☆15May 12, 2023Updated 2 years ago
- A simple playground for dbt with the sqlite connector☆12May 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Lyrics Generator based on GPT-2☆10Jun 20, 2023Updated 2 years ago
- Add accent for Vietnamese. N-Grams + Beam search, LSTM, Transformer, Evolved Transformer☆18Feb 3, 2021Updated 5 years ago
- Create agents in PHP that monitor and act on your behalf. A Laravel based Huginn port.☆13Jan 4, 2023Updated 3 years ago
- An end-2-end project about Son Tung M-TP☆27Sep 25, 2025Updated 6 months ago
- ☆12Nov 18, 2022Updated 3 years ago
- ☆49Aug 14, 2024Updated last year
- Ensemble PhoBERT with FastText Embedding to improve performance on Vietnamese Sentiment Analysis tasks.☆16Jun 29, 2023Updated 2 years ago
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆102Nov 3, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Telegram bot using GPT4 API☆15Aug 26, 2024Updated last year
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- The AI models used for my personal purposes and their usage (Gemini, Copilot, Dialogflow,...)☆19Apr 5, 2024Updated last year
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆21Jun 19, 2021Updated 4 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 6 months ago
- On-premises ELT Pipeline☆31Jul 10, 2025Updated 8 months ago
- ☆16Mar 9, 2026Updated 3 weeks ago
- ☆11Dec 28, 2020Updated 5 years ago
- ☆13May 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 7 months ago
- Scalable Realtime Credit Card Fraud Detection (CCFD) system☆76Oct 29, 2025Updated 5 months ago
- This repository contains a Docker Compose configuration for running ScyllaDB, a highly scalable NoSQL database for learning and testing.☆13Sep 19, 2024Updated last year
- Gem mysql2 agent for huginn☆14Jun 5, 2017Updated 8 years ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆58Mar 22, 2024Updated 2 years ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 2 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- Code to build models that effectively predict promoter-driven gene expression☆11May 15, 2025Updated 10 months ago
- ☆12Oct 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Feb 27, 2024Updated 2 years ago
- ZMK firmware for Urchin and Corne 36 keyboard with nice!nano and nice!view☆16Jan 16, 2026Updated 2 months ago
- ☆16Oct 15, 2021Updated 4 years ago
- Renamed training source for "Thach Thuc" Academic Contest☆24Mar 23, 2024Updated 2 years ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- An implementation of Pregel framework and graph algorithms on top of it with Ibis project DataFrames.☆23Apr 7, 2025Updated 11 months ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆24Jul 31, 2025Updated 7 months ago