google / space
Unified storage framework for the entire machine learning lifecycle
☆156Updated last year
Alternatives and similar repositories for space:
Users that are interested in space are comparing it to the libraries listed below
- Ray - A curated list of resources: https://github.com/ray-project/ray☆59Updated 3 months ago
- Drift detection module for machine learning pipelines.☆25Updated last year
- experiments with inference on llama☆104Updated 11 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆140Updated 3 months ago
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆91Updated 11 months ago
- Parquet-based ML data format optimized for working with unstructured data☆140Updated 2 years ago
- FIL backend for the Triton Inference Server☆77Updated last week
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- Ray-based Apache Beam runner☆42Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆157Updated 5 months ago
- cuVS - a library for vector search and clustering on the GPU☆394Updated this week
- Tracking Ray Enhancement Proposals☆53Updated last month
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆67Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆131Updated 4 months ago
- PyTorch per step fault tolerance (actively under development)☆293Updated this week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆361Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆210Updated this week
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆135Updated last month
- Python SDK for XetHub☆50Updated 6 months ago
- MLFlow Deployment Plugin for Ray Serve☆44Updated 3 years ago
- Python API for https://vespa.ai, the open big data serving engine☆122Updated this week
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆97Updated 10 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- Fine-tuning LLMs on Flyte and Union Cloud☆28Updated last year
- Distributed skorch on Ray Train☆57Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆80Updated 2 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆178Updated this week