google / space
Unified storage framework for the entire machine learning lifecycle
☆146Updated 6 months ago
Related projects: ⓘ
- cuVS - a library for vector search and clustering on the GPU☆170Updated this week
- Ray-based Apache Beam runner☆41Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆73Updated 2 months ago
- LOTUS: The semantic query engine - process data with LMs as easily as writing pandas code☆164Updated 3 weeks ago
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆147Updated this week
- TorchFix - a linter for PyTorch-using code with autofix support☆73Updated last week
- Slides and recordings of talks hosted by our community☆18Updated 3 months ago
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆50Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆81Updated this week
- Serverless Python with Ray☆52Updated last year
- Drift detection module for machine learning pipelines.☆20Updated last year
- experiments with inference on llama☆106Updated 3 months ago
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆109Updated last month
- ☆26Updated last year
- Tracking Ray Enhancement Proposals☆48Updated 3 weeks ago
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆87Updated 3 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆145Updated this week
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆57Updated 11 months ago
- ☆58Updated 3 weeks ago
- Feature Engine for real-time AI/ML☆36Updated this week
- Three examples of recommendation system pipelines with NVIDIA Merlin and Redis☆56Updated last year
- Parquet-based ML data format optimized for working with unstructured data☆138Updated last year
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆78Updated 2 years ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆95Updated this week
- Distributed XGBoost on Ray☆137Updated 2 months ago
- ☆54Updated 8 months ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- ☆55Updated 10 months ago
- Vector Database with support for late interaction and token level embeddings.☆51Updated last week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆194Updated this week