ray-project / enhancements
Tracking Ray Enhancement Proposals
☆50Updated 2 weeks ago
Alternatives and similar repositories for enhancements:
Users that are interested in enhancements are comparing it to the libraries listed below
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆94Updated 8 months ago
- Ray-based Apache Beam runner☆43Updated last year
- Pygloo provides Python bindings for Gloo.☆21Updated 2 weeks ago
- ☆31Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆253Updated 2 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆327Updated last week
- A minimal shared memory object store design☆50Updated 8 years ago
- Exoshuffle-CloudSort☆24Updated 2 years ago
- Distributed ML Optimizer☆30Updated 3 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆154Updated 3 months ago
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆195Updated this week
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆43Updated last year
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆140Updated last week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆350Updated this week
- Python bindings for UCX☆126Updated 2 weeks ago
- Ray - A curated list of resources: https://github.com/ray-project/ray☆52Updated last month
- Simple Distributed Deep Learning on TensorFlow☆134Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- Home for OctoML PyTorch Profiler☆107Updated last year
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- Lightning In-Memory Object Store☆45Updated 3 years ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- Resource-adaptive cluster scheduler for deep learning training.☆435Updated 2 years ago
- ☆15Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆130Updated this week
- RAPIDS GPU-BDB☆108Updated last year
- A library that translates Python and NumPy to optimized distributed systems code.☆132Updated 2 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆63Updated 2 years ago
- CUDA checkpoint and restore utility☆305Updated last month