gitgud / cs6515_public
☆16Updated 4 months ago
Alternatives and similar repositories for cs6515_public:
Users that are interested in cs6515_public are comparing it to the libraries listed below
- Dagster University courses☆74Updated last week
- A curated list of resources about Snowflake☆236Updated last year
- Simple stream processing pipeline☆100Updated 10 months ago
- A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.☆123Updated 4 years ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆124Updated last year
- An end-to-end ELT pipeline to store simulated heart rate data inside a data warehouse; uses Kafka for real-time processing, Airbyte for d…☆14Updated 10 months ago
- Sample files, code snippets and downloads for Snowflake labs and tutorials.☆181Updated this week
- Engineering Management Leadership handbook☆31Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆36Updated 11 months ago
- Django-based course management platform for Zoomcamps☆64Updated 3 weeks ago
- Sample project to demonstrate data engineering best practices☆185Updated last year
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake☆168Updated 2 months ago
- Data pipeline that scrapes Rust cheater Steam profiles☆52Updated 3 years ago
- ☆181Updated 4 years ago
- Serverless ETL using cloud functions https://fivetran.com/docs/functions☆57Updated last year
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆317Updated 3 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- Project for "Data pipeline design patterns" blog.☆45Updated 8 months ago
- ☆54Updated 3 months ago
- This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.☆34Updated last year
- Udacity Data Engineering Nano Degree (DEND)☆184Updated 5 years ago
- A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.☆40Updated last year
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆85Updated last year
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆13Updated 2 years ago
- ☆21Updated 4 months ago
- This repo contains DAGs demonstrating a variety of ELT patterns using Airflow along with dbt.☆11Updated 2 years ago
- System Design, Solution Architecture, Data Systems Practice☆44Updated 2 weeks ago
- Code for dbt tutorial☆156Updated 10 months ago
- This Repo contain details related to Data Engineering tech stacks in GCP☆55Updated 3 months ago
- Notes and Materials for Machine Learning for Trading CS7646 (Fall 2020)☆15Updated 4 years ago