Code base for CDE bootcamp
☆77Jan 17, 2026Updated 4 months ago
Alternatives and similar repositories for CDE-BOOTCAMP
Users that are interested in CDE-BOOTCAMP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple NYSE Simulator on MemSQL☆19Nov 5, 2015Updated 10 years ago
- Code for DE101 book at https://de101.startdataengineering.com/☆105Feb 22, 2026Updated 2 months ago
- A dockerised Airflow repository to automate API calls to your bank account (only Monzo currently supported) to store transactions in a Po…☆28Dec 25, 2022Updated 3 years ago
- 2021 - Github companion to "Demand Prediction in Retail: A Practical Guide to Leverage Data and Predictive Analytics" (Springer Series in…☆37Jul 3, 2021Updated 4 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Sep 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆95Sep 14, 2022Updated 3 years ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆146Aug 23, 2023Updated 2 years ago
- ☆171May 20, 2022Updated 4 years ago
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆40,809May 3, 2026Updated 2 weeks ago
- Code from the book Fighting Churn With Data☆313Aug 2, 2025Updated 9 months ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆274Mar 1, 2026Updated 2 months ago
- Source Code for 'Hands-on Time Series Analysis with Python' by B V Vishwas and Ashish Patel☆372Sep 8, 2020Updated 5 years ago
- A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support loc…☆306Oct 8, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 4 months ago
- A Challenge To Test My SQL Skills For A Braintree Data Analyst Interview Process☆392Apr 25, 2018Updated 8 years ago
- Includes notes on using Apache Spark, with drill down on Spark for Physics, how to run TPCDS on PySpark, how to create histograms with S…☆460May 6, 2026Updated 2 weeks ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆640Dec 26, 2025Updated 4 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆493Oct 15, 2024Updated last year
- 📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)☆695Mar 16, 2023Updated 3 years ago
- My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…☆506Aug 24, 2022Updated 3 years ago
- api versioning for fastapi web applications☆845Jul 25, 2023Updated 2 years ago
- Learn how to design, develop, deploy and iterate on production-grade ML applications.☆3,358Aug 16, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆874Jun 16, 2023Updated 2 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆876Apr 16, 2022Updated 4 years ago
- Personal Data Engineering Projects☆1,014Feb 8, 2023Updated 3 years ago
- A comprehensive Python package template to kickstart and standardize your MLOps initiatives and data pipelines.☆1,407Jan 25, 2026Updated 3 months ago
- The Data Engineering Cookbook☆15,088Jan 17, 2026Updated 4 months ago
- More than 2000+ Data engineer interview questions.☆1,599Jan 13, 2026Updated 4 months ago
- Public repo for DeepLearning.AI MLEP Specialization☆1,964Oct 28, 2024Updated last year
- Feathr – A scalable, unified data and AI engineering platform for enterprise☆1,929Apr 4, 2024Updated 2 years ago
- Asyncer, async and await, focused on developer experience.☆2,431May 12, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A library of sklearn compatible categorical variable encoders☆2,489May 5, 2026Updated 2 weeks ago
- Open Content for self-directed learning in data science☆2,978May 24, 2025Updated 11 months ago
- An Awesome List of Open-Source Data Engineering Projects☆3,181Oct 4, 2024Updated last year
- Async database support for Python. 🗄☆4,005May 21, 2024Updated last year
- This is a repo with links to everything you'd ever want to learn about data engineering☆41,347Apr 2, 2026Updated last month
- Learn AI/ML for beginners with a roadmap and free resources.☆4,362May 9, 2026Updated last week
- Notebooks using the Hugging Face libraries 🤗☆4,553Updated this week