microsoft / nxs
Neural Network Execution Service
☆11Updated last year
Alternatives and similar repositories for nxs:
Users that are interested in nxs are comparing it to the libraries listed below
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 7 months ago
- Notes and artifacts from the ONNX steering committee☆25Updated last week
- Renee: End-to-end training of extreme classification models☆21Updated last year
- Scoreboard for ONNX Backend Compatibility☆27Updated this week
- ☆12Updated 3 years ago
- Lightweight Deep Learning Model Training library based on PyTorch☆32Updated 2 years ago
- graspologic-native is a library of rust components to add additional capability to graspologic a python library for intelligently buildin…☆12Updated this week
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆34Updated last month
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- Composable metric reporters in Python.☆13Updated 7 months ago
- ☆35Updated last year
- benchmarking some transformer deployments☆26Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Code for paper: "Privately generating tabular data using language models".☆14Updated last year
- MozoLM: A language model (LM) serving library☆44Updated 2 months ago
- ☆57Updated 7 months ago
- Imageinary is a reproducible mechanism which is used to generate large image datasets at various resolutions. The tool supports multiple …☆26Updated last year
- Home for OctoML PyTorch Profiler☆107Updated last year
- Implementation of a Tensorflow XLA rematerialization pass☆15Updated 5 years ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated last year
- ONNX Runtime Web benchmark tool☆8Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- ☆21Updated this week
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 5 months ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆53Updated 4 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆34Updated 2 years ago
- ☆23Updated 2 years ago
- GPU Environment Management for Visual Studio Code☆37Updated last year