Code for DE101 book at https://de101.startdataengineering.com/
☆111Feb 22, 2026Updated 4 months ago
Alternatives and similar repositories for data_engineering_for_beginners_code
Users that are interested in data_engineering_for_beginners_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official ClickHouse Agentic Data Stack - self-host with ClickHouse, LibreChat, Langfuse, and ClickHouse MCP.☆75Jun 18, 2026Updated last week
- ☆15Mar 29, 2024Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- A framework to manage data, continuously☆35Jan 20, 2025Updated last year
- ☆10Mar 19, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Jul 24, 2024Updated last year
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated 2 years ago
- All the ressources and guide to practice the Patou Tips☆34May 3, 2026Updated last month
- Repository for the D ONE MLOps AWS BlogPost☆10May 5, 2026Updated last month
- ☆26Sep 28, 2023Updated 2 years ago
- ☆27Jan 21, 2026Updated 5 months ago
- A step-by-step learning journey with dltHub: building modern, Python-based data ingestion pipelines from beginner to advanced.☆30Oct 17, 2025Updated 8 months ago
- ☆21Aug 8, 2024Updated last year
- Google Cloud Dataflow Examples☆13May 19, 2016Updated 10 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Dec 7, 2023Updated 2 years ago
- Cloud Functions streaming insert to BigQuery (with Cloud Pub/Sub trigger). In this example, the function will make a REST API call to get…☆29Aug 28, 2023Updated 2 years ago
- In this repository we store all materials for dlt workshops, courses, etc.☆260May 27, 2026Updated last month
- An open-source Python package that uses AI to predict Nigerian languages, including English, Pidgin, Yoruba, Hausa, and Igbo.☆28Nov 8, 2025Updated 7 months ago
- Data Engineering Best Practices, published by Packt☆27May 17, 2026Updated last month
- 🔥 Preguntas de entrevista, roadmaps, recursos y más para Data Engineering 🔥☆55Feb 25, 2026Updated 4 months ago
- Face Recognition Using CNN in Real-Time Videos☆21Feb 14, 2025Updated last year
- ☆19Feb 25, 2022Updated 4 years ago
- Data Engineering project using Databricks PySpark & Spark SQL for analysing data from Spotify API and present in form of PowerBI report☆53Nov 26, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Firefox extension that shows parquet schema when going over GCP cloud storage. Use DuckDB WASM☆12Jan 19, 2024Updated 2 years ago
- A wallpaper engine that turns life, year, and goal data into beautiful SVG/PNG wallpapers - built with Cloudflare Pages + Workers + resvg…☆56Mar 2, 2026Updated 3 months ago
- RDF storage and SPARQL processing on top of Apache Spark.☆21Oct 5, 2022Updated 3 years ago
- A script/docker that automatically translates PDFs using the DeepL API☆13Jun 14, 2026Updated 2 weeks ago
- Download closed captions from youtube videos (both manual and automatically generated), python implementation☆17Jan 1, 2022Updated 4 years ago
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- A lightweight and flexible analysis pipeline☆12Jun 18, 2026Updated last week
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆18Jun 19, 2022Updated 4 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Show leading/trailing whitespace☆42Updated this week
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- Apache Ignite Quick Start Guide, published by Packt☆12Jan 30, 2023Updated 3 years ago
- ☆10Jan 24, 2023Updated 3 years ago
- The "World Data Report" is a Power BI project that offers a detailed overview of global data, covering weather, geographical, demographic…☆15Nov 30, 2025Updated 7 months ago
- Building a Data Pipeline with an Open Source Stack☆59Jun 27, 2025Updated last year
- Parses a valid YAML string into a struct which implements the DeserializeOwned trait from serde☆16Aug 9, 2025Updated 10 months ago