Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.
☆29Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for udacity-data-eng-proj3
Users that are interested in udacity-data-eng-proj3 are comparing it to the libraries listed below
Sorting:
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- Sample Faust project to process tweets in real-time☆13Mar 29, 2021Updated 4 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Nov 22, 2021Updated 4 years ago
- Python Rest Client to interact against Schema Registry confluent server☆179Nov 24, 2025Updated 3 months ago
- Stock Market Data Fetching Project(US/China)☆10Mar 10, 2021Updated 4 years ago
- ☆15Aug 18, 2021Updated 4 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Jun 23, 2016Updated 9 years ago
- A Cookiecutter template for creating Faust projects quickly.☆70Dec 1, 2022Updated 3 years ago
- End-to-end ELT data engineering project☆22Dec 24, 2022Updated 3 years ago
- Tweepy Stream Example☆19Apr 23, 2019Updated 6 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- This repo has some proposed agenda for Azure Machine Learning related hands-on workshops.☆11Feb 2, 2021Updated 5 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Apr 16, 2024Updated last year
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated last year
- Spark data pipeline that processes movie ratings data.☆31Mar 1, 2026Updated last week
- [ACL 2023] TeAST: Temporal Knowledge Graph Embedding via Archimedean Spiral Timeline☆12Mar 4, 2024Updated 2 years ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- Flask based Movie Recommendation System☆12May 1, 2023Updated 2 years ago
- exemplar code to download all option chains for a symbol using pyetrade (V1 Etrade API)☆10Sep 28, 2021Updated 4 years ago
- (2018) Churn Management & Customer Retention Project for my Ryerson Capstone using Tableau, R, AWS and SQL☆10Sep 13, 2018Updated 7 years ago
- movie-recommendation-system-GUI☆10Aug 15, 2020Updated 5 years ago
- Some example projects for Data Engineers to build, end-to-end.☆38Nov 8, 2023Updated 2 years ago
- 简称"GTC",是一款非常适合小型团队快速将代码通过构建后发布到K8S的部署工具。不需要学习专业的k8s、容器镜像知识也能轻松上手搭建一套自己的CI/CD服务平台。☆12Jun 5, 2023Updated 2 years ago
- An application that takes your current location, address, or latitude/longitude and returns a map showing crimes that have occurred near …☆10Dec 6, 2020Updated 5 years ago
- Technology Demonstration for the IBM BPM Analytics solution based on Elasticsearch and Kibana☆14Dec 15, 2017Updated 8 years ago
- Source code repository for the AISTAT 2023 paper Transport Reversible Jump Proposals.☆10Mar 3, 2023Updated 3 years ago
- Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker☆11Jul 18, 2017Updated 8 years ago
- A model to predict river water temperature (RWT) using air temperature and discharge☆10Dec 11, 2017Updated 8 years ago
- hxyFrame-base-boot☆12Dec 16, 2023Updated 2 years ago
- Multi-node monitor / manager for Pocket Network Validator nodes☆10Dec 9, 2020Updated 5 years ago
- Source code for NeurIPS 2019 paper "Learning Latent Processes from High-Dimensional Event Sequences via Efficient Sampling""☆10Mar 20, 2021Updated 4 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- A primer on using the 'synthpop' package for the biobehavioral sciences☆11Mar 31, 2020Updated 5 years ago
- Python for multiobjective cash management☆12Sep 21, 2017Updated 8 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- ☆17May 26, 2023Updated 2 years ago
- Generates a tree of an S3 bucket contents☆10Sep 18, 2020Updated 5 years ago