Code base for CDE bootcamp
☆77Jun 2, 2026Updated this week
Alternatives and similar repositories for CDE-BOOTCAMP
Users that are interested in CDE-BOOTCAMP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A step-by-step learning journey with dltHub: building modern, Python-based data ingestion pipelines from beginner to advanced.☆29Oct 17, 2025Updated 7 months ago
- ☆21Aug 8, 2024Updated last year
- ☆20Apr 3, 2024Updated 2 years ago
- Function for automatically detecting Simpson's Paradox☆18Jan 17, 2021Updated 5 years ago
- Code for DE101 book at https://de101.startdataengineering.com/☆108Feb 22, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Sep 7, 2023Updated 2 years ago
- How to build and deploy an anonymization API with FastAPI and SpaCy☆70Jul 21, 2021Updated 4 years ago
- In this repository we store all materials for dlt workshops, courses, etc.☆258May 27, 2026Updated last week
- ☆95Sep 14, 2022Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆42,113May 3, 2026Updated last month
- This is a public repository to go over all the LLM-driven data engineering concepts.☆1,152Oct 26, 2024Updated last year
- A simple implementation of Google's Quick, Draw Project for humans. 🖌️ 🖼️☆235Jun 9, 2025Updated last year
- Projects done in the Data Engineering Nanodegree by Udacity.com☆275Mar 1, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source Code for 'Hands-on Time Series Analysis with Python' by B V Vishwas and Ashish Patel☆372Sep 8, 2020Updated 5 years ago
- Getting start with PySpark and MLlib☆299May 7, 2018Updated 8 years ago
- Learn how to create, develop, and maintain a state-of-the-art MLOps code base☆702Apr 27, 2026Updated last month
- ☆395Jan 26, 2025Updated last year
- 📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)☆704Mar 16, 2023Updated 3 years ago
- My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…☆505Aug 24, 2022Updated 3 years ago
- Fuzzy string matching, grouping, and evaluation.☆798Jul 10, 2025Updated 10 months ago
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆876Jun 16, 2023Updated 2 years ago
- This repository is used for one of the projects in Udacity's Front-End Web Developer Nanodegree program. Learn how to become a Front-End …☆1,232Jun 28, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆881Apr 16, 2022Updated 4 years ago
- Personal Data Engineering Projects☆1,015Feb 8, 2023Updated 3 years ago
- A comprehensive Python package template to kickstart and standardize your MLOps initiatives and data pipelines.☆1,410Jan 25, 2026Updated 4 months ago
- All things systems design. Resources, Interview questions, e.t.c☆1,040Aug 27, 2021Updated 4 years ago
- Contains files related to content and project of DSND Term 2☆1,093Dec 6, 2022Updated 3 years ago
- Port(ish) of Great Expectations to dbt test macros☆1,227Dec 16, 2024Updated last year
- Starter project code for students taking Udacity ud120☆1,643Apr 23, 2024Updated 2 years ago
- The Data Engineering Cookbook☆15,132Jan 17, 2026Updated 4 months ago
- the portable Python dataframe library☆6,567Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- More than 2000+ Data engineer interview questions.☆1,633Jan 13, 2026Updated 4 months ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,227Jul 4, 2023Updated 2 years ago
- From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.☆24,705Updated this week
- Machine Learning University: Accelerated Natural Language Processing Class☆2,432Oct 13, 2024Updated last year
- Free MLOps course from DataTalks.Club☆14,722May 3, 2026Updated last month
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,914Aug 26, 2022Updated 3 years ago
- emmet for vim: http://emmet.io/☆6,462Mar 24, 2026Updated 2 months ago