project-codeflare / zero-copy-model-loadingLinks
In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"
☆26Updated 2 years ago
Alternatives and similar repositories for zero-copy-model-loading
Users that are interested in zero-copy-model-loading are comparing it to the libraries listed below
Sorting:
- Simple dependency injection framework for Python☆21Updated last year
- Productionize machine learning predictions, with ONNX or without☆65Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- MLFlow Deployment Plugin for Ray Serve☆45Updated 3 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated last year
- 🐍 Python bidding for the Hora Approximate Nearest Neighbor Search Algorithm library☆72Updated 3 years ago
- Model compression for ONNX☆96Updated 6 months ago
- Pygloo provides Python bindings for Gloo.☆22Updated 3 months ago
- Notes and artifacts from the ONNX steering committee☆26Updated this week
- ☆39Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆31Updated last month
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- An unofficial Python client library for Lambda Lab's Cloud Computing Platform☆12Updated 2 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆110Updated 3 weeks ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆61Updated 4 months ago
- wasm bindings for huggingface tokenizers library☆34Updated 2 years ago
- ☆13Updated 2 years ago
- Sentence Embedding as a Service☆15Updated last year
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- A file utility for accessing both local and remote files through a unified interface.☆42Updated 3 weeks ago
- PyTorch Single Controller☆16Updated this week
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆67Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Module, Model, and Tensor Serialization/Deserialization☆234Updated last week
- ☆34Updated 2 weeks ago
- Plugin for deploying MLflow models to TorchServe☆109Updated 2 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆44Updated 10 months ago
- Article about deploying machine learning models using grpc, pytorch and asyncio☆28Updated 2 years ago
- High-performance safetensors model loader☆36Updated this week