How to build an awesome data engineering team
☆101Sep 11, 2019Updated 6 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆899May 8, 2022Updated 3 years ago
- ☆32Aug 13, 2018Updated 7 years ago
- A list of useful resources to learn Data Engineering from scratch☆3,987Jun 19, 2024Updated last year
- ☆13Oct 6, 2019Updated 6 years ago
- Sharing interesting and noteworthy Data Engineering content☆70Oct 21, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,903Aug 26, 2022Updated 3 years ago
- A curated list of data engineering tools for software developers☆8,564Apr 5, 2026Updated 3 weeks ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Jul 13, 2016Updated 9 years ago
- Selected resources for SRE/DevOps professionals covering various Computer Science areas: Software Engineering & Architecture, Operations,…☆28Jan 4, 2018Updated 8 years ago
- ☆12Apr 16, 2016Updated 10 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- Coursera, Big Data Essentials: HDFS, MapReduce and Spark RDD☆12Jun 18, 2019Updated 6 years ago
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- ☆197Feb 25, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Aug 18, 2021Updated 4 years ago
- ☆16Jun 25, 2019Updated 6 years ago
- Vietnamese Named Entity Recognition☆31Oct 12, 2020Updated 5 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆91Jul 17, 2019Updated 6 years ago
- ETL Pipeline using Luigi☆10Nov 15, 2017Updated 8 years ago
- Creates an example AWS DMS for replicating an (on-prem) Oracle database to a cloud-based Postgres database☆13Oct 24, 2017Updated 8 years ago
- RAG applications repo for Uplimit course☆10Jul 20, 2025Updated 9 months ago
- Explain & predict! 📈☆15Sep 30, 2022Updated 3 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆169Dec 8, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repo to migrate old wiki to, esp for devs and code examples☆183Oct 18, 2016Updated 9 years ago
- ☆44Apr 21, 2022Updated 4 years ago
- An Awesome List of Open-Source Data Engineering Projects☆3,155Oct 4, 2024Updated last year
- A simple Spark-powered ETL framework that just works 🍺☆186Oct 2, 2025Updated 6 months ago
- In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will…☆24Nov 28, 2018Updated 7 years ago
- Это репозиторий telegram канала по инженерии данных. Собраны материалы: мысли, кейсы и полезные ссылки.☆18Apr 27, 2025Updated last year
- scripts for personal reference☆19Dec 26, 2022Updated 3 years ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆67Oct 17, 2022Updated 3 years ago
- Software mention extraction and linking from scientific articles☆14Sep 2, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Feb 20, 2020Updated 6 years ago
- ☆17Aug 31, 2021Updated 4 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Jan 21, 2021Updated 5 years ago
- Reference Architectures for Datalakes on AWS☆78May 13, 2020Updated 5 years ago
- 个人小主页https://twistedw.github.io☆11Aug 23, 2021Updated 4 years ago
- Example end to end data engineering project.☆1,412Dec 8, 2022Updated 3 years ago
- Common support code for user-facing front end systems.☆12Apr 21, 2026Updated last week