Data Engineering Handbook for beginners and everyone
☆79Jul 13, 2024Updated last year
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fail, Retry, Succeed With Valkryrietry For Golang☆15Feb 10, 2026Updated 4 months ago
- ☆21Nov 4, 2023Updated 2 years ago
- Practical guidlines with machine learning lifecycle especially in computer vision domains☆16Jun 16, 2023Updated 3 years ago
- GB: Построение хранилища данных и основы ETL☆10Mar 27, 2021Updated 5 years ago
- ☆13Sep 23, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Submit your task quickly.☆10Apr 16, 2021Updated 5 years ago
- A simple project to analyse Malaysian airports - an opportunity to play with tools like Luigi, Docker, and Metabase as part of an end-to-…☆13Jul 25, 2023Updated 2 years ago
- Decals and Surfaces for Skylines II☆12Sep 14, 2025Updated 9 months ago
- ☆18Jan 23, 2026Updated 4 months ago
- A guide to creating and using the Yoyo migrations tool with Postgres☆24Jun 6, 2019Updated 7 years ago
- A fanmade JKT48 website to sort your favorite JKT48 members☆27Apr 30, 2026Updated last month
- An example repository showing how to leverage Kafka to stream your data☆21May 11, 2024Updated 2 years ago
- A library for generating pseudo-random (but "realistic") data in python. A port of the faker gem to python (making use of its rich locale…☆19Oct 16, 2014Updated 11 years ago
- Sample example projects referenced for opensource.com articles☆11Dec 19, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- DuckDB extension for readin PCAP files☆19Aug 31, 2024Updated last year
- ☆10Oct 7, 2024Updated last year
- Data Vault 2.0: Code generation, Vertica, Airflow☆13Nov 20, 2019Updated 6 years ago
- Simple web-app to provide illustration about a take on luck and hard work.☆30Apr 1, 2021Updated 5 years ago
- The codes for the paper of "A particle swarm optimization-based flexible convolutional auto-encoder for image classification" published b…☆10Jul 21, 2020Updated 5 years ago
- ☆21Nov 21, 2023Updated 2 years ago
- Instantly understand and summarize JSON structure through automatic schema inference via a Python CLI☆26Nov 3, 2024Updated last year
- BitDust user App written in Python using Kivy framework☆14Aug 23, 2025Updated 9 months ago
- Some python scripts for beginners, written for the book Automating The Internet with Python☆13Oct 1, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Use Redis to handle your event stream in Laravel☆11Dec 10, 2022Updated 3 years ago
- Based on our paper "Pneumonia Detection from Lung X-ray Images using Local Search Aided Sine Cosine Algorithm based Deep Feature Selectio…☆11Jun 26, 2022Updated 3 years ago
- Udacity Data Engineering Nanodegree Project 3☆12Jul 14, 2019Updated 6 years ago
- Rust parser for Clickhouse SQL dialect.☆24Feb 16, 2022Updated 4 years ago
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- ☆16Dec 14, 2021Updated 4 years ago
- Materials and code relating to Learning Intelligence 25.☆11Mar 23, 2018Updated 8 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆48Oct 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collection of shell/Bash scripts for various using cases | #SE☆11Jun 8, 2026Updated last week
- Integration of opentelemetry with the tracing crate☆26Jun 8, 2026Updated last week
- Copy of awslabs/autogluon tutorial notebooks☆21Sep 20, 2022Updated 3 years ago
- ☆19Dec 1, 2019Updated 6 years ago
- Generate massive fake datasets for your datalake, fast. By SOMA☆20Apr 17, 2026Updated last month
- Introduction to Modern Data Analytics Tools Docker, Airbyte, DBT, Apache Superset with Brazilian Ecommerce Data & Applying RFM in DBT☆13Sep 8, 2022Updated 3 years ago
- Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath☆10May 7, 2020Updated 6 years ago