GokuMohandas / data-engineering
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
☆213Updated 2 years ago
Alternatives and similar repositories for data-engineering:
Users that are interested in data-engineering are comparing it to the libraries listed below
- Curriculum and roadmap from 0 to Mastery for MLOps. Adding value to your machine learning model by deploying it for people to use it to s…☆183Updated 3 years ago
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆21Updated 2 years ago
- Learn by doing: DIY project groups at DataTalks.Club☆401Updated 10 months ago
- ML Zoomcamp fall 2021 homework and stuff☆64Updated 3 years ago
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆56Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- Demo for CI/CD in a machine learning project☆104Updated last year
- Learning paths for data roles☆128Updated 4 years ago
- Develop and deploy a real-time feature pipeline in Python, using Bytewax 🐝 and Hopsworks Feature Store.☆134Updated last year
- Learn how to create reliable ML systems by testing code, data and models.☆86Updated 2 years ago
- Data analytics interview questions and answers☆60Updated 4 years ago
- Practical Deep Learning at Scale with MLFlow, published by Packt☆159Updated last year
- ☆33Updated last year
- A MLOps platform using prefect, mlflow, FastAPI, Prometheus/Grafana und streamlit☆82Updated 2 years ago
- Data pipeline for extracting, transforming, and visualising Covid-19 data☆14Updated last year
- A book of subtle code tricks and gem resources for all things data, machine learning and deep learning.☆165Updated 7 months ago
- A list of awesome data podcasts☆373Updated last year
- The web page for DataTalks.Club☆206Updated this week
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Updated 2 years ago
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆145Updated last year
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆182Updated 9 months ago
- A quick reference guide to the most commonly used patterns and functions in PySpark SQL☆54Updated 3 years ago
- Fetch, transform and plot real-time OHLC data from Coinbase using Bytewax, Bokeh and Streamlit☆127Updated 11 months ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Udacity Data Engineering Nanodegree Program☆52Updated 4 years ago
- Awesome list of resources for analytics engineers☆25Updated 3 years ago
- Example project with a complete MLOps cycle: versioning data, generating reports on pull requests and deploying the model on releases wit…☆48Updated 3 years ago
- Public data and analytics for our open course☆32Updated last year
- Code for the "Build Your Own Search Engine" workshop☆83Updated 9 months ago
- LLM-powered RAG Question Answering Slack bot for DataTalksClub Zoomcamps☆56Updated 2 months ago