logicalclocks / hopsworks
Hopsworks - Data-Intensive AI platform with a Feature Store
☆1,169Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for hopsworks
- ☆704Updated 2 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,800Updated 11 months ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,312Updated last month
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,012Updated 2 months ago
- MLeap: Deploy ML Pipelines to Production☆1,504Updated last week
- Jupyter magics and kernels for working with remote Spark clusters☆1,330Updated this week
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integra…☆1,447Updated this week
- Python API for Deequ☆731Updated last month
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,042Updated this week
- An open protocol for secure data sharing☆771Updated last week
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆626Updated 3 weeks ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆383Updated 2 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆317Updated this week
- Kubeflow’s superfood for Data Scientists☆632Updated last year
- Feathr – A scalable, unified data and AI engineering platform for enterprise☆1,986Updated 7 months ago
- Joblib Apache Spark Backend☆242Updated 3 months ago
- The Open Source Feature Store for Machine Learning☆5,615Updated this week
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more☆724Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,702Updated 3 months ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,915Updated last week
- Library for exploring and validating machine learning data☆765Updated this week
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆497Updated 2 months ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆822Updated this week
- A tool for building feature stores.☆283Updated last month
- Distributed SQL Engine in Python using Dask☆397Updated 2 months ago
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆717Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆53Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆708Updated 3 months ago
- An Open Standard for lineage metadata collection☆1,773Updated this week