feathr-ai / feathr
Feathr – A scalable, unified data and AI engineering platform for enterprise
☆1,986Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for feathr
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,166Updated 2 weeks ago
- The Open Source Feature Store for Machine Learning☆5,613Updated this week
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,818Updated this week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,801Updated 11 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,013Updated last month
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆383Updated 2 years ago
- An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…☆2,653Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,913Updated last week
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆717Updated last year
- A model-agnostic visual debugging tool for machine learning☆1,651Updated last year
- MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integra…☆1,446Updated this week
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆375Updated 6 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆626Updated 3 weeks ago
- MLeap: Deploy ML Pipelines to Production☆1,504Updated last week
- ☆704Updated 2 years ago
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆810Updated 7 months ago
- An open protocol for secure data sharing☆770Updated last week
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆847Updated last year
- ML pipeline orchestration and model deployments on Kubernetes.☆435Updated last year
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,040Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆743Updated this week
- An Open Standard for lineage metadata collection☆1,772Updated this week
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆957Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆3,964Updated this week
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metada…☆1,874Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,386Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,702Updated 3 months ago
- High performance model preprocessing library on PyTorch☆649Updated 7 months ago
- TFX is an end-to-end platform for deploying production ML pipelines☆2,114Updated this week