Data Engineering Handbook for beginners and everyone
☆80Jul 13, 2024Updated last year
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Nov 4, 2023Updated 2 years ago
- Practical guidlines with machine learning lifecycle especially in computer vision domains☆16Jun 16, 2023Updated 2 years ago
- A straightforward starter template for Python packages.☆11Mar 18, 2025Updated last year
- vintage theme for the helix editor☆10Jan 28, 2024Updated 2 years ago
- ☆17Jan 23, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- GDK stands for Go Software Development Kit.☆10Feb 28, 2026Updated 3 weeks ago
- ☆13Mar 29, 2022Updated 3 years ago
- Tableau Prep Cookbook, published by Packt☆16Mar 2, 2026Updated 3 weeks ago
- Simple utilities to manage R environments☆16Apr 19, 2024Updated last year
- Simple and dynamic role-based access control for Rails☆12Aug 13, 2021Updated 4 years ago
- An example repository showing how to leverage Kafka to stream your data☆21May 11, 2024Updated last year
- Sample example projects referenced for opensource.com articles☆11Dec 19, 2023Updated 2 years ago
- Heartbeat Monitoring Service☆12Mar 20, 2026Updated last week
- Data Vault 2.0: Code generation, Vertica, Airflow☆13Nov 20, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This repo contains "Azure Data Engineer Associate" Questions and related docs.☆14Jan 29, 2024Updated 2 years ago
- BitDust user App written in Python using Kivy framework☆14Aug 23, 2025Updated 7 months ago
- B19415 - The Definitive Guide to Data Integration☆11Apr 15, 2024Updated last year
- dbtVault + Greenplum demo☆11Feb 19, 2024Updated 2 years ago
- Based on our paper "Pneumonia Detection from Lung X-ray Images using Local Search Aided Sine Cosine Algorithm based Deep Feature Selectio…☆11Jun 26, 2022Updated 3 years ago
- Benchmarks and examples for a "Slow Auto, Inconvenient Semi" presentation☆18Mar 26, 2025Updated last year
- Rust parser for Clickhouse SQL dialect.☆24Feb 16, 2022Updated 4 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- Материалы курса Airflow 101☆15Jun 15, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆44May 16, 2024Updated last year
- Materials and code relating to Learning Intelligence 25.☆10Mar 23, 2018Updated 8 years ago
- ☆18Apr 13, 2024Updated last year
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆48Oct 14, 2024Updated last year
- An implementation of Pregel framework and graph algorithms on top of it with Ibis project DataFrames.☆23Apr 7, 2025Updated 11 months ago
- Minimalistic, standalone alternative fake data generator with no dependencies☆20Mar 19, 2026Updated last week
- Introduction to Modern Data Analytics Tools Docker, Airbyte, DBT, Apache Superset with Brazilian Ecommerce Data & Applying RFM in DBT☆13Sep 8, 2022Updated 3 years ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 5 years ago
- ☆14Dec 29, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Result-like type that can collect multiple Errs☆12Oct 5, 2020Updated 5 years ago
- BAIT509 - Business Applications of Machine Learning☆13Feb 7, 2024Updated 2 years ago
- R Client for BigQuery Storage API☆25Sep 17, 2025Updated 6 months ago
- ☆30Apr 12, 2025Updated 11 months ago
- ☆42Apr 3, 2023Updated 2 years ago
- ☆11Jun 16, 2018Updated 7 years ago
- 🕌 Muslim Board Browser Extension☆166Aug 6, 2025Updated 7 months ago