Breaking Into Data Handbook
☆364Jun 29, 2024Updated last year
Alternatives and similar repositories for break-into-data-handbook
Users that are interested in break-into-data-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Break Into Data 30 Day ML Challenge☆43May 11, 2024Updated last year
- ☆10Jan 4, 2019Updated 7 years ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆40,948Apr 2, 2026Updated 2 weeks ago
- Introduction to Data Science (University of Utah) – Lecture Material☆13Apr 18, 2024Updated 2 years ago
- Everything about LLMs in production.☆79Jun 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Public data and analytics for our open course☆34Mar 22, 2024Updated 2 years ago
- A guide for technical professionals looking to start consulting☆1,519Jan 3, 2025Updated last year
- ☆12Jun 9, 2025Updated 10 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 8 months ago
- ☆19Oct 27, 2025Updated 5 months ago
- Data Engineering Practice Problems☆2,625Jan 8, 2025Updated last year
- End-to-end Azure DE project with Australia Health Expenditure dataset. Services used include Azure Data Factory, DataBricks, Data Lake, K…☆13Feb 25, 2024Updated 2 years ago
- ☆18Nov 13, 2024Updated last year
- Example FastAPI app deployed to AWS with CDK.☆16Feb 23, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repository for code examples from my youtube channel and medium articles working with data in python on AWS☆29Feb 5, 2024Updated 2 years ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆40,011Apr 8, 2026Updated last week
- Learn ML engineering for free in 4 months! Register here 👇🏼☆12,925Dec 27, 2025Updated 3 months ago
- ☆14May 2, 2024Updated last year
- Surfalytics projces on Data Engineering and Analytics☆121Apr 5, 2026Updated 2 weeks ago
- Analytics Engineering best practices and standards used at Hiflylabs☆12Jul 7, 2025Updated 9 months ago
- Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.☆1,410Dec 15, 2025Updated 4 months ago
- Discovering deep embedding spaces for Psychiatric imaging☆16Jan 14, 2018Updated 8 years ago
- Compute and store real-time features for crypto trading using Bytwax (stream processing) and Hopsworks (Feature Store)☆146Jun 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AI-ready open dataset of e-commerce coupons, deals & redeem-links curated by Kindred☆17May 2, 2025Updated 11 months ago
- In this repository we store all materials for dlt workshops, courses, etc.☆257Mar 11, 2026Updated last month
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆71Dec 17, 2023Updated 2 years ago
- This is a public repository to go over all the LLM-driven data engineering concepts.☆1,142Oct 26, 2024Updated last year
- Example projects built on MotherDuck☆48Updated this week
- A production-ready iOS automation MCP server built with FastMCP 2.0, featuring clean modular architecture with complete platform segregat…☆31Jul 26, 2025Updated 8 months ago
- A comprehensive Python package template to kickstart and standardize your MLOps initiatives and data pipelines.☆1,406Jan 25, 2026Updated 2 months ago
- Official curricula for the LLMOPs course at Duke University☆104Jun 4, 2024Updated last year
- Personal Data Engineering Projects☆1,005Feb 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆22Mar 29, 2026Updated 3 weeks ago
- Python Essentials for AWS Cloud Developers, published by Packt.☆11Apr 27, 2023Updated 2 years ago
- Data Science Roadmap from A to Z☆22Oct 10, 2023Updated 2 years ago
- This repo is meant to serve as a guide for Machine Learning/AI technical interviews.☆8,039Nov 28, 2025Updated 4 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Sep 7, 2023Updated 2 years ago
- ☆25Apr 6, 2026Updated 2 weeks ago
- Tutorials for the Hopsworks Platform☆317Feb 25, 2026Updated last month