☆196Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for datapipelinesbook
Users that are interested in datapipelinesbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Some exercises to learn Spark. Solved in Python.☆21Oct 15, 2024Updated last year
- Serverless ETL and Analytics with AWS Glue, published by Packt☆53Apr 22, 2026Updated last month
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆293Jul 11, 2024Updated last year
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- This repository contains the code snippets used in "LLM Prompt Engineering For Developers"☆14Apr 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Looker Access is a command line tool to control Looker roles, groups, permission sets and model sets.☆10Apr 20, 2019Updated 7 years ago
- ☆19Mar 27, 2020Updated 6 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Jan 30, 2023Updated 3 years ago
- ☆21May 29, 2024Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆171Oct 31, 2023Updated 2 years ago
- ☆67May 27, 2025Updated 11 months ago
- IBM Data Engineering Professional Certificate☆35May 10, 2025Updated last year
- An e2e pipeline using dlt, dagster, duckdb, and dbt-core☆22Mar 27, 2026Updated last month
- A data generator script for Shopify☆14Sep 5, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 22, 2023Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆385Feb 16, 2026Updated 3 months ago
- Great Expectations Airflow operator☆173Apr 1, 2026Updated last month
- An open-source Python toolkit for preprocessing and analysis of vessel spatio-temporal trajectories.☆23Aug 21, 2022Updated 3 years ago
- Support files for the O'Reilly book "Behavioral Data Analysis with R and Python" by Florent Buisson☆94Jul 15, 2023Updated 2 years ago
- Price Crawler - Tracking Price Inflation☆205Jun 23, 2020Updated 5 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- Examples in Efficient MySQL Performance☆44Aug 5, 2022Updated 3 years ago
- ☆16Mar 16, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Explorations of survival analysis in Python☆50Feb 14, 2023Updated 3 years ago
- Data Engineering with Python, published by Packt☆795Jan 30, 2023Updated 3 years ago
- Automated svn2git mirror of include-what-you-use: link goes to upstream☆13May 27, 2015Updated 10 years ago
- This is a code repository for the LinkedIn Learning course Advance Your SQL Skills for Data Engineering.☆28May 11, 2026Updated last week
- A repository for materials used in Snowflake fundamentals bootcamp at O'Reilly Learning Platform☆18Jun 22, 2025Updated 11 months ago
- ☆16Jul 27, 2025Updated 9 months ago
- Code for Data Pipelines with Apache Airflow☆824Aug 15, 2024Updated last year
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- Snippets of the basic course from Batch Scripting tutorial☆13Aug 15, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🐰 A curated list of awesome Nabaztag resources☆16Feb 20, 2026Updated 3 months ago
- ☆33Apr 17, 2026Updated last month
- Jupyter Notebook Scientific Python Stack extension for Docker Desktop☆18Mar 26, 2024Updated 2 years ago
- Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool☆14Dec 12, 2025Updated 5 months ago
- Sample Airflow DAGs to load data from the CovidTracking API to Snowflake via an AWS S3 intermediary.☆16Jan 6, 2021Updated 5 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- This Repo collects the Material of Regression Project at Udemy platform by Eng/ Mohammed Agoor☆12Sep 9, 2022Updated 3 years ago