cal-data-eng / sp21-materialsLinks
Public facing repository for Data Engineering Spring 2021
☆15Updated 4 years ago
Alternatives and similar repositories for sp21-materials
Users that are interested in sp21-materials are comparing it to the libraries listed below
Sorting:
- Command-line interface to quickly generate fake CSV and JSON data☆72Updated 10 months ago
- Sample project to demonstrate data engineering best practices☆191Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆230Updated 2 years ago
- Public facing work samples for technical hiring assessment☆20Updated last year
- Surfalytics projces on Data Engineering and Analytics☆105Updated 3 weeks ago
- Template for Data Engineering and Data Pipeline projects☆112Updated 2 years ago
- Awesome list of resources for analytics engineers☆26Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆262Updated 10 months ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆324Updated 3 years ago
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆38Updated 2 years ago
- Simple stream processing pipeline☆103Updated 11 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆81Updated last month
- Code for "Efficient Data Processing in Spark" Course☆313Updated 2 weeks ago
- Example Repo to have full end to end pyspark testing via docker-compose☆32Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 10 months ago
- An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information☆26Updated 3 years ago
- Step by step instructions to create a production-ready data pipeline☆50Updated 5 months ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆237Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆77Updated last year
- Code for dbt tutorial☆157Updated last year
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆65Updated last week
- ☆56Updated last year
- A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.☆40Updated 2 years ago
- System Design, Solution Architecture, Data Systems Practice☆47Updated last month
- Repo for CDC with debezium blog post☆28Updated 8 months ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆126Updated last year
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆35Updated last year
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆74Updated 9 months ago
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 3 years ago