jonathandinu / spark-ray-data-science
Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with Spark and Ray in the context of a data scientist's standard workflow.
☆48Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for spark-ray-data-science
- Introduction to Ray Core Design Patterns and APIs.☆61Updated 10 months ago
- The project completed for MLops Engineering Lab #1 by Team #1. See our wiki for more info☆16Updated 3 years ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆39Updated last year
- Best practices for engineering ML pipelines.☆37Updated 2 years ago
- O'Reilly Katacoda☆56Updated 2 years ago
- ☆18Updated 3 years ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆26Updated 2 years ago
- An example MLFlow project☆48Updated 2 years ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.☆37Updated 3 years ago
- Scaling Python Machine Learning☆44Updated last year
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.☆54Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 2 years ago
- Feast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model☆61Updated 3 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆18Updated 3 years ago
- This is a repository for the Duke University Cloud Computing course project on Serveless Data Engineering Pipeline. For this project, I r…☆19Updated 3 years ago
- ☆27Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 2 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆44Updated 2 months ago
- ☆11Updated 2 years ago
- Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).☆96Updated last year
- A MLOps platform using prefect, mlflow, FastAPI, Prometheus/Grafana und streamlit☆73Updated 2 years ago
- Operations Research Algorithms☆17Updated 8 months ago
- real-time data + ML pipeline☆54Updated this week
- ☆42Updated last year
- ☆25Updated 2 years ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆64Updated 6 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Code examples for the Introduction to Kubeflow course☆13Updated 3 years ago