microsoft / nxsLinks
Neural Network Execution Service
☆11Updated 2 years ago
Alternatives and similar repositories for nxs
Users that are interested in nxs are comparing it to the libraries listed below
Sorting:
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Updated 3 years ago
- Sentence Embedding as a Service☆15Updated 7 months ago
- Cortex-compatible model server for Python and TensorFlow☆18Updated 3 years ago
- benchmarking some transformer deployments☆26Updated last month
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆66Updated 2 years ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- MLFlow Deployment Plugin for Ray Serve☆46Updated 3 years ago
- Triton Server Component for lightning.ai☆14Updated 2 years ago
- ☆16Updated 2 months ago
- Codes for paper "KNAS: Green Neural Architecture Search"☆93Updated 4 years ago
- Machine learning utilities for model conversion, serialization, loading etc☆27Updated 3 years ago
- Python client for RedisAI☆89Updated 2 years ago
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.☆73Updated last year
- Torch Distributed Experimental☆117Updated last year
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 3 years ago
- Unified storage framework for the entire machine learning lifecycle☆155Updated last year
- The Triton backend for the PyTorch TorchScript models.☆173Updated this week
- Module, Model, and Tensor Serialization/Deserialization☆286Updated 5 months ago
- ☆14Updated 3 years ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆80Updated last year
- A top-like tool for monitoring GPUs in a cluster☆84Updated last year
- A high performance data access library for machine learning tasks☆74Updated 2 years ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆158Updated 2 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated 3 weeks ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆182Updated last month
- ☆28Updated 2 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated last month
- ☆22Updated 3 weeks ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆20Updated last year