feature-store / ralf
☆31Updated 2 years ago
Alternatives and similar repositories for ralf:
Users that are interested in ralf are comparing it to the libraries listed below
- A resilient distributed training framework☆89Updated 11 months ago
- Tracking Ray Enhancement Proposals☆50Updated 3 weeks ago
- Stateful LLM Serving☆48Updated last week
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- Modyn is a research-platform for training ML models on growing datasets.☆46Updated this week
- Distributed ML Optimizer☆30Updated 3 years ago
- ☆43Updated 3 years ago
- ☆44Updated last year
- Exoshuffle-CloudSort☆24Updated 2 years ago
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆112Updated last year
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆82Updated last year
- Simple Distributed Deep Learning on TensorFlow☆134Updated 2 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆49Updated 2 years ago
- ☆93Updated 2 years ago
- Python package for rematerialization-aware gradient checkpointing☆24Updated last year
- ☆15Updated last year
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data☆64Updated 7 months ago
- LLM Serving Performance Evaluation Harness☆70Updated 3 weeks ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆37Updated 6 months ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127Updated 2 years ago
- ☆45Updated 8 months ago
- ☆23Updated last year
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated 10 months ago
- Releasing the spot availability traces used in "Can't Be Late" paper.☆18Updated 11 months ago
- Model-less Inference Serving☆85Updated last year
- Research and development for optimizing transformers☆125Updated 4 years ago
- Microsoft Collective Communication Library☆60Updated 3 months ago
- Ultra | Ultimate | Unified CCL☆50Updated last month
- ☆53Updated 9 months ago