Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
☆40Apr 29, 2024Updated 2 years ago
Alternatives and similar repositories for docker_for_data_engineers
Users that are interested in docker_for_data_engineers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Apr 26, 2024Updated 2 years ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Repository for Data Engineering Interview Series☆37Oct 17, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Primary repository for NYC DCP's Data Engineering team☆39Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Dec 11, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 11 months ago
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆11Nov 8, 2024Updated last year
- Code for DE101 book at https://de101.startdataengineering.com/☆99Feb 22, 2026Updated 2 months ago
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 3 weeks ago
- Some Windows images for tool images that I had to use in a Windows Environment.☆10Sep 27, 2020Updated 5 years ago
- Sample project to demonstrate data engineering best practices☆217Feb 24, 2024Updated 2 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆33Jun 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Track SLAs☆16May 9, 2024Updated last year
- Decals and Surfaces for Skylines II☆12Sep 14, 2025Updated 7 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- ☆13Jul 6, 2021Updated 4 years ago
- Companion repository to the ETL & ELT Pipelines with Apache Airflow® eBook☆41Feb 16, 2026Updated 2 months ago
- ☆12Dec 7, 2023Updated 2 years ago
- A command line client builder that follows the Canonical's Guidelines for a Command Line Interface.☆15Apr 26, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Oct 11, 2023Updated 2 years ago
- Sample example projects referenced for opensource.com articles☆11Dec 19, 2023Updated 2 years ago
- Pi-hole Maintenance PRO MAX for Pi-hole v6.x on Raspberry Pi: automated apt updates, Gravity refresh, logging, backups, health checks.☆20Dec 27, 2025Updated 4 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆111Jan 8, 2026Updated 3 months ago
- An open-source Python package that uses AI to predict Nigerian languages, including English, Pidgin, Yoruba, Hausa, and Igbo.☆28Nov 8, 2025Updated 5 months ago
- This repo via a real world use case, shows how to launch dbt models from a DAG in Apache Airflow.☆14Apr 22, 2026Updated last week
- Code for dbt tutorial☆174Sep 9, 2025Updated 7 months ago
- Repositório do Curso Online Python Fundamentos☆20Aug 26, 2016Updated 9 years ago
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆104Jun 7, 2024Updated last year
- Cost Efficient Data Pipelines with DuckDB☆63May 14, 2025Updated 11 months ago
- sci palettes for matplotlib/seaborn☆10Feb 14, 2022Updated 4 years ago
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 7 months ago
- BitDust user App written in Python using Kivy framework☆14Aug 23, 2025Updated 8 months ago
- Some python scripts for beginners, written for the book Automating The Internet with Python☆13Oct 1, 2018Updated 7 years ago
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Sep 25, 2023Updated 2 years ago