google / spaceLinks
Unified storage framework for the entire machine learning lifecycle
☆155Updated last year
Alternatives and similar repositories for space
Users that are interested in space are comparing it to the libraries listed below
Sorting:
- Ray - A curated list of resources: https://github.com/ray-project/ray☆60Updated 4 months ago
- Ray-based Apache Beam runner☆42Updated last year
- ☆58Updated last year
- MLFlow Deployment Plugin for Ray Serve☆45Updated 3 years ago
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆135Updated 2 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆366Updated 2 weeks ago
- Tracking Ray Enhancement Proposals☆54Updated 2 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆218Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆30Updated this week
- IbisML is a library for building scalable ML pipelines using Ibis.☆109Updated 5 months ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- PyTorch per step fault tolerance (actively under development)☆302Updated this week
- Chassis turns machine learning models into portable container images that can run just about anywhere.☆86Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆81Updated 5 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- ML/DL Math and Method notes☆61Updated last year
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆58Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆130Updated last month
- High performance model preprocessing library on PyTorch☆650Updated last year
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆93Updated 11 months ago
- High-Performance Engine for Multi-Vector Search☆80Updated this week
- ClearML - Model-Serving Orchestration and Repository Solution☆150Updated 4 months ago
- Drift detection module for machine learning pipelines.☆25Updated last year
- Distributed XGBoost on Ray☆148Updated 11 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆157Updated 5 months ago
- Ibis Substrait Compiler☆102Updated this week
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆105Updated this week
- Components that I have created for Kubeflow Pipelines. Try them in https://cloud-pipelines.net/pipeline-editor/☆14Updated 3 weeks ago