ray-project / enhancements
Tracking Ray Enhancement Proposals
☆50Updated 3 weeks ago
Alternatives and similar repositories for enhancements:
Users that are interested in enhancements are comparing it to the libraries listed below
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆94Updated 8 months ago
- Pygloo provides Python bindings for Gloo.☆21Updated 2 weeks ago
- Ray-based Apache Beam runner☆43Updated last year
- ☆31Updated 2 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆195Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆155Updated 3 months ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- Distributed ML Optimizer☆30Updated 3 years ago
- A minimal shared memory object store design☆50Updated 8 years ago
- Exoshuffle-CloudSort☆24Updated 2 years ago
- Home for OctoML PyTorch Profiler☆108Updated last year
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆255Updated 2 years ago
- Python bindings for UCX☆126Updated this week
- Ray - A curated list of resources: https://github.com/ray-project/ray☆52Updated last month
- A library that translates Python and NumPy to optimized distributed systems code.☆132Updated 2 years ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆140Updated 2 weeks ago
- Resource-adaptive cluster scheduler for deep learning training.☆436Updated 2 years ago
- RAPIDS GPU-BDB☆108Updated last year
- Simple Distributed Deep Learning on TensorFlow☆134Updated 2 years ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆50Updated 2 years ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆351Updated this week
- Fine-grained GPU sharing primitives☆141Updated 5 years ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆43Updated last year
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆102Updated this week
- Lightning In-Memory Object Store☆45Updated 3 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- A schedule language for large model training☆145Updated 9 months ago