amogkam / batch-inference-benchmarks
☆16Updated last year
Alternatives and similar repositories for batch-inference-benchmarks:
Users that are interested in batch-inference-benchmarks are comparing it to the libraries listed below
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- ☆15Updated last year
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- Ray-based Apache Beam runner☆43Updated last year
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- Machine Learning Inference Graph Spec☆21Updated 5 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated 7 months ago
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆93Updated 7 months ago
- Distributed ML Optimizer☆30Updated 3 years ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆110Updated 2 months ago
- Lightning In-Memory Object Store☆44Updated 3 years ago
- Exoshuffle-CloudSort☆24Updated last year
- Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"☆9Updated 2 years ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆23Updated 4 months ago
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated last year
- Documentation for Hopsworks and Hops☆11Updated 3 years ago
- A minimal shared memory object store design☆49Updated 8 years ago
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.☆72Updated 11 months ago
- Parameter Server implementation in Apache Flink.☆14Updated 6 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- The driver for LMCache core to run in vLLM☆20Updated 2 weeks ago
- MLFlow Deployment Plugin for Ray Serve☆43Updated 2 years ago
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 4 years ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆48Updated 2 years ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆44Updated 3 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated last year