Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
☆40Apr 29, 2024Updated 2 years ago
Alternatives and similar repositories for docker_for_data_engineers
Users that are interested in docker_for_data_engineers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Apr 26, 2024Updated 2 years ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆11Nov 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Jul 24, 2024Updated last year
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Code for DE101 book at https://de101.startdataengineering.com/☆108Feb 22, 2026Updated 3 months ago
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 2 months ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆33Jun 13, 2023Updated 3 years ago
- Project for "Data pipeline design patterns" blog.☆52Aug 6, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Step by step instructions to create a production-ready data pipeline☆61Dec 23, 2024Updated last year
- Track SLAs☆16May 9, 2024Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆92May 5, 2025Updated last year
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- ☆13Jul 6, 2021Updated 4 years ago
- Companion repository to the ETL & ELT Pipelines with Apache Airflow® eBook☆43Feb 16, 2026Updated 3 months ago
- A command line client builder that follows the Canonical's Guidelines for a Command Line Interface.☆15Jun 6, 2026Updated last week
- ☆12Dec 7, 2023Updated 2 years ago
- Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23☆13Oct 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository will contain the code for Java backend developer bootcamp☆11Feb 6, 2025Updated last year
- A library for generating pseudo-random (but "realistic") data in python. A port of the faker gem to python (making use of its rich locale…☆19Oct 16, 2014Updated 11 years ago
- Demo code for dynamically generating web API clients☆12Jul 18, 2016Updated 9 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆114Jan 8, 2026Updated 5 months ago
- An open-source Python package that uses AI to predict Nigerian languages, including English, Pidgin, Yoruba, Hausa, and Igbo.☆28Nov 8, 2025Updated 7 months ago
- This repo via a real world use case, shows how to launch dbt models from a DAG in Apache Airflow.☆14Apr 22, 2026Updated last month
- Repositório do Curso Online Python Fundamentos☆20Aug 26, 2016Updated 9 years ago
- Code for dbt tutorial☆179Jun 4, 2026Updated last week
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆107May 26, 2026Updated 2 weeks ago
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated last year
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 9 months ago
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Sep 25, 2023Updated 2 years ago
- Website contendo a bíblia completa em PHP & MySQL☆10Jan 10, 2025Updated last year
- TypeScript Crash Course - Mar 16-18 2024☆17Mar 16, 2024Updated 2 years ago
- ChartMogul API Python Client☆17Apr 17, 2026Updated last month