How to build an awesome data engineering team
☆101Sep 11, 2019Updated 6 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898May 8, 2022Updated 4 years ago
- ☆32Aug 13, 2018Updated 7 years ago
- A list of useful resources to learn Data Engineering from scratch☆3,993Jun 19, 2024Updated last year
- ☆13Oct 6, 2019Updated 6 years ago
- Sharing interesting and noteworthy Data Engineering content☆69Oct 21, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Dec 17, 2025Updated 5 months ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,907Aug 26, 2022Updated 3 years ago
- A curated list of data engineering tools for software developers☆8,650Updated this week
- Keras implementation of the "Show, Attend and Tell" paper☆26Apr 21, 2019Updated 7 years ago
- All Machine Learning Algorithms☆27Oct 17, 2020Updated 5 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Jul 13, 2016Updated 9 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- Defund the Police.☆19Jun 14, 2020Updated 5 years ago
- This is an example api which covers some topics about api creation with dotnet-core2☆10Dec 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆196Feb 25, 2022Updated 4 years ago
- ☆10Aug 18, 2021Updated 4 years ago
- Amazon Redshift offers a common query interface against data stored in fast, local storage as well as data from high-capacity, inexpensiv…☆13Nov 26, 2018Updated 7 years ago
- RAG applications repo for Uplimit course☆10Jul 20, 2025Updated 10 months ago
- Explain & predict! 📈☆15Sep 30, 2022Updated 3 years ago
- Repo to migrate old wiki to, esp for devs and code examples☆183Oct 18, 2016Updated 9 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆169Dec 8, 2022Updated 3 years ago
- Accompanying repository to Timecampus's Official Series #NeverStop☆12Aug 17, 2020Updated 5 years ago
- ☆45Apr 21, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An Awesome List of Open-Source Data Engineering Projects☆3,181Oct 4, 2024Updated last year
- In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will…☆24Nov 28, 2018Updated 7 years ago
- Это репозиторий telegram канала по инженерии данных. Собраны материалы: мысли, кейсы и полезные ссылки.☆18Apr 27, 2025Updated last year
- scripts for personal reference☆19Dec 26, 2022Updated 3 years ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆66Oct 17, 2022Updated 3 years ago
- Poverty Prediction by Combination of Satellite Imagery☆43Nov 23, 2020Updated 5 years ago
- Simple system monitor for Ubuntu and Nginx. It uses Node.js and internal commands to retrieve the information.☆10Sep 24, 2015Updated 10 years ago
- ☆14Aug 22, 2025Updated 8 months ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Jan 21, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reference Architectures for Datalakes on AWS☆78May 13, 2020Updated 6 years ago
- ☆24Dec 20, 2025Updated 5 months ago
- ☆27Feb 4, 2016Updated 10 years ago
- 个人小主页https://twistedw.github.io☆11Aug 23, 2021Updated 4 years ago
- Spark Custome Stream Source and Sink☆12Jan 19, 2019Updated 7 years ago
- Example end to end data engineering project.☆1,410Dec 8, 2022Updated 3 years ago
- ☆22Sep 2, 2021Updated 4 years ago