shravan-kuchkula/dataEngineering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shravan-kuchkula/dataEngineering)

shravan-kuchkula / dataEngineering

A repo to track data engineering projects

☆14

Alternatives and similar repositories for dataEngineering

Users that are interested in dataEngineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shravan-kuchkula / udacity-data-eng-proj4
View on GitHub
Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …
☆17Oct 1, 2019Updated 6 years ago
shravan-kuchkula / udacity-data-eng-proj3
View on GitHub
Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.
☆29Aug 14, 2023Updated 2 years ago
shravan-kuchkula / udacity-data-eng-proj-1
View on GitHub
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…
☆89Nov 22, 2021Updated 4 years ago
angelddaz / de-challenges
View on GitHub
Project based learning for Data Engineering fundamentals.
☆13Jan 15, 2021Updated 5 years ago
AuFeld / Data_Engineering_Projects
View on GitHub
A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…
☆15Apr 29, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
supratim94336 / DataEngineeringCapstoneProject
View on GitHub
😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS
☆51Aug 23, 2019Updated 6 years ago
vsouza / spark-kinesis-redshift
View on GitHub
Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark
☆11May 22, 2018Updated 8 years ago
alero-awani / batch-data-engineering-project
View on GitHub
A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…
☆18Aug 14, 2025Updated 11 months ago
danielbeach / DataEngineeringProjects
View on GitHub
Some example projects for Data Engineers to build, end-to-end.
☆39Nov 8, 2023Updated 2 years ago
CICIFLY / Data_Engineering_Project_Portfolio
View on GitHub
Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3
☆33Feb 2, 2021Updated 5 years ago
GrzegorzGatkowski / Air_Pollution_Pipeline
View on GitHub
Data Engineering Project in GCP
☆22Mar 29, 2023Updated 3 years ago
GuruCharan94 / az-podcast-transcriber
View on GitHub
A podcast transcription service built on Azure that transcribes any new episode of your podcast and displays synchronized transcripts alo…
☆10Dec 10, 2022Updated 3 years ago
ybangaru / wallstreetbets-sentiment-analysis
View on GitHub
☆10May 24, 2021Updated 5 years ago
VicenteYago / steam-data-engineering
View on GitHub
A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!
☆27Nov 8, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bpatters / graphiql-feen
View on GitHub
Chrome Extension for Development/Testing/Exploring GraphQL Servers
☆14Oct 1, 2018Updated 7 years ago
ArpiteshSrivastava / spotify-data-engineering-project
View on GitHub
In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…
☆25May 6, 2023Updated 3 years ago
MartyC-137 / Data-Engineering
View on GitHub
A project portfolio to accompany my resume
☆30Sep 5, 2023Updated 2 years ago
chuqiaoshen / Git-Influencer
View on GitHub
Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Net…
☆16May 21, 2024Updated 2 years ago
LaurentRisser / DS_project_ETL_with_AWS_Twitter
View on GitHub
Setup an ETL from Twitter API to S3
☆10Nov 20, 2020Updated 5 years ago
akashsethi24 / Machine-Learning
View on GitHub
Examples of all Machine Learning Algorithm in Apache Spark
☆15Nov 2, 2017Updated 8 years ago
tkh5044 / portfolio
View on GitHub
My professional portfolio with some of my best data science projects.
☆11Jun 22, 2017Updated 9 years ago
liquidtelecom / Golang-Training-Examples
View on GitHub
This repo contains example code used for golang training
☆10Feb 19, 2023Updated 3 years ago
vijaykothareddy / Data-Engineering
View on GitHub
Code for my blogs on Data Engineering
☆15Nov 9, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jess197 / football_statistics_etl_project
View on GitHub
☆13Dec 28, 2023Updated 2 years ago
manceps / fashion-mnist-kfp-lab
View on GitHub
A notebook showing how to easily convert a current notebook you have to a notebook that can be run on Kubeflow Pipelines.
☆15Jul 15, 2020Updated 6 years ago
aws-samples / aws-sagemaker-ml-blog-predictive-campaigns
View on GitHub
Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker
☆18Feb 10, 2019Updated 7 years ago
javalite / javalite-examples
View on GitHub
A selection of projects that shows how to use various parts of JavaLite
☆16Feb 21, 2024Updated 2 years ago
anbento0490 / tutorials
View on GitHub
☆21Jan 21, 2023Updated 3 years ago
huynhsamha / simple-go-ethereum
View on GitHub
Interact to smart contract on Ropsten Test Network Ethereum using Golang
☆12May 31, 2019Updated 7 years ago
prs98 / Backorders_Supply_Chain_Analysis
View on GitHub
Understanding the supply chain process data and implementing different algorithms, building a machine learning model that can predict whe…
☆13Sep 7, 2022Updated 3 years ago
aimlcommunity / Breast-Cancer-Detection-using-Machine-Learning
View on GitHub
This is a guided certification project, as a part of Data Science for Social Good initiative
☆18Mar 9, 2020Updated 6 years ago
angellyao / formulaandcode
View on GitHub
鲁伟《机器学习公式推导与代码实现》。整体对算法的分类是亮点。算法原理和代码实现也相对简单，可以和《机器学习实战》对比起来看。
☆10Oct 19, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
datoujinggzj / WhaleDataAnalysisProject
View on GitHub
14天完成数据分析实战项目
☆10Sep 7, 2022Updated 3 years ago
Apress / beg-dbs-w-postgresql
View on GitHub
Source code for 'Beginning Databases with PostgreSQL' by Richard Stones and Neil Matthew
☆18Mar 30, 2017Updated 9 years ago
SAP-archive / data-warehouse-cloud-modeling
View on GitHub
This repository aims to onboard new users into Modeling in SAP Data Warehouse Cloud in the most practical manner. For that you will build…
☆18Feb 2, 2024Updated 2 years ago
jonathanhayes / Tweepy-Twitter-Stream-Example
View on GitHub
Tweepy Stream Example
☆19Apr 23, 2019Updated 7 years ago
alanchn31 / Loan-Default-Prediction
View on GitHub
Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy
☆22Dec 26, 2020Updated 5 years ago
Snowboard-Software / dbt_airbyte_shopify_facebook_paypal_fedex_gls_ecommerce_profitability
View on GitHub
This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loaded…
☆30Jun 14, 2024Updated 2 years ago
hhimanshu / scala-fundamentals
View on GitHub
Learn Scala Fundamentals by creating a working bank!
☆15Jan 22, 2019Updated 7 years ago