project-codeflare / zero-copy-model-loading
In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"
β26Updated 2 years ago
Alternatives and similar repositories for zero-copy-model-loading:
Users that are interested in zero-copy-model-loading are comparing it to the libraries listed below
- Simple dependency injection framework for Pythonβ20Updated 11 months ago
- Pygloo provides Python bindings for Gloo.β22Updated last month
- π Python bidding for the Hora Approximate Nearest Neighbor Search Algorithm libraryβ72Updated 3 years ago
- Plugin for deploying MLflow models to TorchServeβ108Updated 2 years ago
- The Triton backend for the PyTorch TorchScript models.β146Updated this week
- β30Updated last week
- Module, Model, and Tensor Serialization/Deserializationβ223Updated 2 months ago
- experiments with inference on llamaβ104Updated 10 months ago
- Serialize JAX, Flax, Haiku, or Objax model params with π€`safetensors`β44Updated 10 months ago
- A file utility for accessing both local and remote files through a unified interface.β40Updated last week
- TorchFix - a linter for PyTorch-using code with autofix supportβ138Updated 2 months ago
- Sentence Embedding as a Serviceβ15Updated last year
- β39Updated 2 years ago
- β12Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.β109Updated last week
- A collection of reproducible inference engine benchmarksβ24Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β156Updated 4 months ago
- Distributed skorch on Ray Trainβ57Updated 2 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated last year
- MLFlow Deployment Plugin for Ray Serveβ44Updated 3 years ago
- PyTorch centric eager mode debuggerβ47Updated 4 months ago
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-β¦β67Updated last year
- A top-like tool for monitoring GPUs in a clusterβ86Updated last year
- Home for OctoML PyTorch Profilerβ112Updated 2 years ago
- vLLM adapter for a TGIS-compatible gRPC server.β26Updated this week
- The Triton backend for the ONNX Runtime.β140Updated last week
- Productionize machine learning predictions, with ONNX or withoutβ65Updated last year
- benchmarking some transformer deploymentsβ26Updated 2 years ago
- Cortex-compatible model server for Python and TensorFlowβ17Updated 2 years ago
- ClearML - Model-Serving Orchestration and Repository Solutionβ149Updated 3 months ago