amogkam / batch-inference-benchmarksLinks
☆19Updated 2 years ago
Alternatives and similar repositories for batch-inference-benchmarks
Users that are interested in batch-inference-benchmarks are comparing it to the libraries listed below
Sorting:
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆100Updated last year
- Exoshuffle-CloudSort☆29Updated 2 years ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 3 years ago
- ☆16Updated 2 years ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆130Updated 4 months ago
- Lightning In-Memory Object Store☆47Updated 4 years ago
- MLFlow Deployment Plugin for Ray Serve☆46Updated 3 years ago
- A minimal shared memory object store design☆60Updated 9 years ago
- Distributed ML Optimizer☆35Updated 4 years ago
- Serverless ML Framework☆107Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Updated 5 years ago
- Trisk on Flink☆16Updated 3 years ago
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.☆73Updated last year
- Code for Ernest☆34Updated 2 years ago
- Three examples of recommendation system pipelines with NVIDIA Merlin and Redis☆70Updated 9 months ago
- SnailTrail implementation☆40Updated 6 years ago
- Machine Learning Inference Graph Spec☆21Updated 6 years ago
- Elastic Deep Learning for deep learning framework on Kubernetes☆175Updated 2 years ago
- ☆13Updated 7 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Updated 3 years ago
- This repository contains statistics about the AI Infrastructure products.☆17Updated 11 months ago
- The specification of the LDBC Financial Benchmark☆19Updated last month
- Parameter Server implementation in Apache Flink.☆14Updated 7 years ago
- Distributed XGBoost on Ray☆152Updated last year
- Wukong: A scalable and locality-enhanced serverless parallel framework (ACM SoCC'20)☆76Updated last year
- ☆30Updated 3 years ago
- FlorDB 🌻☆158Updated 3 months ago
- Fast I/O plugins for Spark☆41Updated 5 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Updated 6 years ago
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆60Updated 2 years ago