antgroup / ant-rayLinks
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top of the community version.
☆162Updated this week
Alternatives and similar repositories for ant-ray
Users that are interested in ant-ray are comparing it to the libraries listed below
Sorting:
- ☆222Updated 2 years ago
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆99Updated last year
- KV cache store for distributed LLM inference☆378Updated last month
- Efficient and easy multi-instance LLM serving☆520Updated 4 months ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Updated 2 years ago
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆99Updated 2 years ago
- vsag is a vector indexing library used for similarity search.☆443Updated this week
- GLake: optimizing GPU memory management and IO transmission.☆494Updated 9 months ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆271Updated 2 years ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆352Updated last week
- Some resources about Ray Forward Meetup☆28Updated 2 weeks ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆44Updated 4 years ago
- Puck is a high-performance ANN search engine☆366Updated 7 months ago
- The driver for LMCache core to run in vLLM☆59Updated 11 months ago
- Tracking Ray Enhancement Proposals☆61Updated 3 weeks ago
- Fault-tolerant for DL frameworks☆70Updated 2 years ago
- Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond☆735Updated last month
- A workload for deploying LLM inference services on Kubernetes☆153Updated 2 weeks ago
- alibabacloud-jindodata☆200Updated last month
- A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from trainin…☆123Updated 2 weeks ago
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆159Updated last year
- Automatic tuning for ML model deployment on Kubernetes☆81Updated last year
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆357Updated 2 weeks ago
- ☆518Updated last month
- ☆138Updated this week
- Elastic Deep Learning for deep learning framework on Kubernetes☆175Updated 2 years ago
- Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.☆310Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 3 months ago
- A modular acceleration toolkit for big data analytic engines☆67Updated last year
- Material for Ray Connect 2024 Conference☆12Updated last year