Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
☆40Apr 29, 2024Updated 2 years ago
Alternatives and similar repositories for docker_for_data_engineers
Users that are interested in docker_for_data_engineers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Apr 26, 2024Updated 2 years ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Repository for Data Engineering Interview Series☆38Oct 17, 2024Updated last year
- Primary repository for NYC DCP's Data Engineering team☆40Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Dec 11, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆11Nov 8, 2024Updated last year
- Code for DE101 book at https://de101.startdataengineering.com/☆105Feb 22, 2026Updated 3 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Beginner data engineering project - batch edition☆583Apr 13, 2026Updated last month
- Sample project to demonstrate data engineering best practices☆219Feb 24, 2024Updated 2 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- Step by step instructions to create a production-ready data pipeline☆60Dec 23, 2024Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆92May 5, 2025Updated last year
- Decals and Surfaces for Skylines II☆12Sep 14, 2025Updated 8 months ago
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- ☆13Jul 6, 2021Updated 4 years ago
- Companion repository to the ETL & ELT Pipelines with Apache Airflow® eBook☆41Feb 16, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Dec 7, 2023Updated 2 years ago
- A command line client builder that follows the Canonical's Guidelines for a Command Line Interface.☆14Updated this week
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Oct 11, 2023Updated 2 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆113Jan 8, 2026Updated 4 months ago
- An open-source Python package that uses AI to predict Nigerian languages, including English, Pidgin, Yoruba, Hausa, and Igbo.☆28Nov 8, 2025Updated 6 months ago
- This repo via a real world use case, shows how to launch dbt models from a DAG in Apache Airflow.☆14Apr 22, 2026Updated last month
- Repositório do Curso Online Python Fundamentos☆20Aug 26, 2016Updated 9 years ago
- Code for dbt tutorial☆178Sep 9, 2025Updated 8 months ago
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Apache Arrow Guide☆17Oct 10, 2021Updated 4 years ago
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated last year
- BitDust user App written in Python using Kivy framework☆14Aug 23, 2025Updated 9 months ago
- Some python scripts for beginners, written for the book Automating The Internet with Python☆13Oct 1, 2018Updated 7 years ago
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Sep 25, 2023Updated 2 years ago
- TypeScript Crash Course - Mar 16-18 2024☆17Mar 16, 2024Updated 2 years ago
- ChartMogul API Python Client☆17Apr 17, 2026Updated last month