How to build an awesome data engineering team
☆101Sep 11, 2019Updated 6 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898May 8, 2022Updated 4 years ago
- A list of useful resources to learn Data Engineering from scratch☆3,996Jun 19, 2024Updated 2 years ago
- ☆13Oct 6, 2019Updated 6 years ago
- LSTM model for Vietnamese Named Entity Recognition☆17Jul 26, 2017Updated 8 years ago
- Sharing interesting and noteworthy Data Engineering content☆68Oct 21, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,945Aug 26, 2022Updated 3 years ago
- A curated list of data engineering tools for software developers☆8,773Jun 22, 2026Updated last week
- This project contains the code to translate between Apache Spark and SFrame.☆20Jul 13, 2016Updated 9 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- ☆196Feb 25, 2022Updated 4 years ago
- ☆10Aug 18, 2021Updated 4 years ago
- ☆16Jun 25, 2019Updated 7 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆92Jul 17, 2019Updated 6 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆169Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repo to migrate old wiki to, esp for devs and code examples☆181Oct 18, 2016Updated 9 years ago
- ☆13Jun 14, 2017Updated 9 years ago
- ☆45Apr 21, 2022Updated 4 years ago
- Python client for the Serf orchestration tool☆22Apr 30, 2021Updated 5 years ago
- ☆10Feb 23, 2017Updated 9 years ago
- some helpers to create swagger output from a pecan app☆10Aug 12, 2016Updated 9 years ago
- An Awesome List of Open-Source Data Engineering Projects☆3,224Oct 4, 2024Updated last year
- A simple Spark-powered ETL framework that just works 🍺☆185Oct 2, 2025Updated 8 months ago
- In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will…☆24Nov 28, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- scripts for personal reference☆18Dec 26, 2022Updated 3 years ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆67Oct 17, 2022Updated 3 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Jan 21, 2021Updated 5 years ago
- ☆97Jul 18, 2014Updated 11 years ago
- Tech and Venture Capital Toolkit☆14Jul 3, 2016Updated 9 years ago
- Reference Architectures for Datalakes on AWS☆78May 13, 2020Updated 6 years ago
- ☆12Sep 30, 2020Updated 5 years ago
- ☆27Feb 4, 2016Updated 10 years ago
- Example end to end data engineering project.☆1,414Dec 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Common support code for user-facing front end systems.☆12Updated this week
- Coder in your OpenShift/Kubernetes Cluster or in Docker☆10Oct 22, 2021Updated 4 years ago
- Accumulated knowledge and experience in the field of Data Engineering☆872Nov 22, 2022Updated 3 years ago
- Polls sample app for IBM BlueMix PaaS☆14Jun 25, 2014Updated 12 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆74Oct 3, 2020Updated 5 years ago
- A sample custom Spark Structured Streaming Datasource with Websockets☆12May 14, 2020Updated 6 years ago