google / space
Unified storage framework for the entire machine learning lifecycle
☆156Updated 10 months ago
Alternatives and similar repositories for space:
Users that are interested in space are comparing it to the libraries listed below
- Ray - A curated list of resources: https://github.com/ray-project/ray☆48Updated this week
- Drift detection module for machine learning pipelines.☆21Updated last year
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆124Updated last month
- PyTorch per step fault tolerance (actively under development)☆226Updated this week
- Transform datasets at scale. Optimize datasets for fast AI model training.☆406Updated last week
- experiments with inference on llama☆104Updated 7 months ago
- FIL backend for the Triton Inference Server☆76Updated 3 weeks ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆96Updated last month
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆153Updated last month
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 9 months ago
- Python API for https://vespa.ai, the open big data serving engine☆113Updated this week
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 3 weeks ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆79Updated last month
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆119Updated last month
- Vector Database with support for late interaction and token level embeddings.☆51Updated 4 months ago
- Scalable and Performant Data Loading☆211Updated this week
- ☆58Updated 10 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- cuVS - a library for vector search and clustering on the GPU☆278Updated this week
- Slides and recordings of talks hosted by our community☆19Updated 7 months ago
- A library for building and serving multi-node distributed faiss indices.☆261Updated last year
- End-to-End LLM Guide☆99Updated 6 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Self-host LLMs with vLLM and BentoML☆79Updated 2 weeks ago
- Serverless Python with Ray☆54Updated 2 years ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆240Updated this week
- Squirrel dataset hub☆42Updated last year
- PyTorch centric eager mode debugger☆44Updated last month