GokuMohandas / data-engineeringLinks
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
☆217Updated 2 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below
Sorting:
- Curriculum and roadmap from 0 to Mastery for MLOps. Adding value to your machine learning model by deploying it for people to use it to s…☆183Updated 3 years ago
- ML Zoomcamp fall 2021 homework and stuff☆66Updated 3 years ago
- Develop and deploy a real-time feature pipeline in Python, using Bytewax 🐝 and Hopsworks Feature Store.☆135Updated last year
- Learn by doing: DIY project groups at DataTalks.Club☆408Updated last year
- Practical Deep Learning at Scale with MLFlow, published by Packt☆160Updated last year
- Demo for CI/CD in a machine learning project☆106Updated last year
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆183Updated 11 months ago
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆56Updated 2 years ago
- Learn how to create reliable ML systems by testing code, data and models.☆87Updated 2 years ago
- A MLOps platform using prefect, mlflow, FastAPI, Prometheus/Grafana und streamlit☆85Updated 2 years ago
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆147Updated last year
- Learning paths for data roles☆129Updated 4 years ago
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆21Updated 2 years ago
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Updated 2 years ago
- ☆35Updated 2 years ago
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆166Updated 9 months ago
- ☆32Updated 2 years ago
- Data analytics interview questions and answers☆62Updated 4 years ago
- A quick reference guide to the most commonly used patterns and functions in PySpark SQL☆55Updated 3 years ago
- Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in productio…☆86Updated last year
- Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit☆129Updated last year
- Code for the "Build Your Own Search Engine" workshop☆100Updated 3 weeks ago
- Learn how to monitor ML systems to identify and mitigate sources of drift before model performance decay.☆86Updated 2 years ago
- Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications☆103Updated 2 years ago
- A list of awesome data podcasts☆377Updated 2 years ago
- ☆109Updated 2 years ago
- Enrolled in DataTalks Zoomcamp https://github.com/DataTalksClub/mlops-zoomcamp☆21Updated 2 years ago
- LLM-powered RAG Question Answering Slack bot for DataTalksClub Zoomcamps☆56Updated 2 weeks ago
- Exercises performed as part of the ML Zoomcamp course☆30Updated 3 years ago