GokuMohandas / data-engineering
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
☆213Updated 2 years ago
Alternatives and similar repositories for data-engineering:
Users that are interested in data-engineering are comparing it to the libraries listed below
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆56Updated 2 years ago
- Practical Deep Learning at Scale with MLFlow, published by Packt☆159Updated last year
- Curriculum and roadmap from 0 to Mastery for MLOps. Adding value to your machine learning model by deploying it for people to use it to s…☆182Updated 3 years ago
- Learn by doing: DIY project groups at DataTalks.Club☆399Updated 9 months ago
- ☆33Updated last year
- Demo for CI/CD in a machine learning project☆104Updated last year
- LLM-powered RAG Question Answering Slack bot for DataTalksClub Zoomcamps☆54Updated last month
- ML Zoomcamp fall 2021 homework and stuff☆63Updated 3 years ago
- A MLOps platform using prefect, mlflow, FastAPI, Prometheus/Grafana und streamlit☆81Updated 2 years ago
- Develop and deploy a real-time feature pipeline in Python, using Bytewax 🐝 and Hopsworks Feature Store.☆133Updated last year
- Learn how to create reliable ML systems by testing code, data and models.☆86Updated 2 years ago
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆21Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆165Updated 6 months ago
- A stripped-down version MLOps Zoomcamp (1.5 hours workshop)☆36Updated last year
- Data analytics interview questions and answers☆61Updated 4 years ago
- MLOps maturity assessment☆60Updated last year
- Learning paths for data roles☆128Updated 4 years ago
- Data pipeline for extracting, transforming, and visualising Covid-19 data☆14Updated last year
- Code for the "Build Your Own Search Engine" workshop☆80Updated 8 months ago
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Updated 2 years ago
- ☆108Updated 2 years ago
- Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).☆96Updated 2 years ago
- A simple guide to MLOps through ZenML and its various integrations.☆186Updated last year
- ☆14Updated last year
- Reference code base for ML Engineering, Manning Publications☆126Updated 3 years ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆73Updated 10 months ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆179Updated 8 months ago
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆145Updated 11 months ago
- Serverless Machine Learning Course for building AI-enabled Prediction Services from models and features☆560Updated 5 months ago