Breaking Into Data Handbook
☆364Jun 29, 2024Updated last year
Alternatives and similar repositories for break-into-data-handbook
Users that are interested in break-into-data-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Break Into Data 30 Day ML Challenge☆43May 11, 2024Updated last year
- ☆10Jan 4, 2019Updated 7 years ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆41,171Apr 2, 2026Updated last month
- Everything about LLMs in production.☆79Jun 29, 2024Updated last year
- Public data and analytics for our open course☆34Mar 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A guide for technical professionals looking to start consulting☆1,527Jan 3, 2025Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 8 months ago
- Data Engineering Practice Problems☆2,662Jan 8, 2025Updated last year
- End-to-end Azure DE project with Australia Health Expenditure dataset. Services used include Azure Data Factory, DataBricks, Data Lake, K…☆13Feb 25, 2024Updated 2 years ago
- ☆18Nov 13, 2024Updated last year
- Example FastAPI app deployed to AWS with CDK.☆16Feb 23, 2023Updated 3 years ago
- Repository for code examples from my youtube channel and medium articles working with data in python on AWS☆29Feb 5, 2024Updated 2 years ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆40,666Updated this week
- Learn ML engineering for free in 4 months! Register here 👇🏼☆13,044Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14May 2, 2024Updated 2 years ago
- Surfalytics projces on Data Engineering and Analytics☆121Apr 5, 2026Updated last month
- Analytics Engineering best practices and standards used at Hiflylabs☆12Jul 7, 2025Updated 10 months ago
- Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.☆1,415Dec 15, 2025Updated 4 months ago
- Example Repo for the Udemy Course "Deployment of Machine Learning Models"☆20Mar 6, 2020Updated 6 years ago
- Simple audio AE☆13Nov 10, 2024Updated last year
- A list of useful resources to learn Data Engineering from scratch☆3,990Jun 19, 2024Updated last year
- Compute and store real-time features for crypto trading using Bytwax (stream processing) and Hopsworks (Feature Store)☆146Jun 28, 2023Updated 2 years ago
- A reddit sentiment analysis application. Allows users to search for a subreddit and get a sentiment report of the overall subreddit and p…☆22Sep 2, 2025Updated 8 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆11Apr 7, 2022Updated 4 years ago
- Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.☆12,505Aug 31, 2023Updated 2 years ago
- In this repository we store all materials for dlt workshops, courses, etc.☆258Mar 11, 2026Updated last month
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆72Dec 17, 2023Updated 2 years ago
- Engineering Management Leadership handbook☆35Feb 14, 2024Updated 2 years ago
- This is a public repository to go over all the LLM-driven data engineering concepts.☆1,148Oct 26, 2024Updated last year
- Example projects built on MotherDuck☆49Apr 26, 2026Updated 2 weeks ago
- A comprehensive Python package template to kickstart and standardize your MLOps initiatives and data pipelines.☆1,407Jan 25, 2026Updated 3 months ago
- Official curricula for the LLMOPs course at Duke University☆104Jun 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Biwhitened PCA☆15Mar 18, 2026Updated last month
- Personal Data Engineering Projects☆1,011Feb 8, 2023Updated 3 years ago
- The Data Study Hall is a repo used for storing and distributing references, reviews, and learning activities designed by myself to teach …☆11Jul 31, 2022Updated 3 years ago
- ☆14Aug 10, 2021Updated 4 years ago
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆22Mar 29, 2026Updated last month
- Fork ou laboratório técnico usado para estudo, testes e referência.☆22Oct 10, 2023Updated 2 years ago
- ☆10Jul 12, 2023Updated 2 years ago