microsoft / nxs
Neural Network Execution Service
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for nxs
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 5 months ago
- Scoreboard for ONNX Backend Compatibility☆27Updated this week
- ☆34Updated last year
- Composable metric reporters in Python.☆12Updated 5 months ago
- 3rd party dependencies for DALI project☆10Updated last week
- Creating Generative AI Apps which work☆16Updated 4 months ago
- ☆20Updated this week
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- Cortex-compatible model server for Python and TensorFlow☆16Updated last year
- Renee: End-to-end training of extreme classification models☆21Updated last year
- Code for paper: "Privately generating tabular data using language models".☆14Updated last year
- API serving for your diffusers models☆10Updated 10 months ago
- Sentence Embedding as a Service☆14Updated last year
- The official repo of our research work "Interactive Editing for Text Summarization".☆22Updated last year
- A file utility for accessing both local and remote files through a unified interface.☆36Updated 3 months ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 3 months ago
- MLFlow Deployment Plugin for Ray Serve☆42Updated 2 years ago
- Evaluate Transformers from the Hub 🔥☆13Updated 11 months ago
- graspologic-native is a library of rust components to add additional capability to graspologic a python library for intelligently buildin…☆9Updated 9 months ago
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆34Updated last month
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆52Updated 2 months ago
- Notes and artifacts from the ONNX steering committee☆25Updated last week
- Create a source of truth for ML model results and browse it on Papers with Code☆26Updated 3 years ago
- This repository contains statistics about the AI Infrastructure products.☆18Updated 4 months ago
- Hugging Face and Pyserini interoperability☆19Updated last year
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆40Updated last week
- ☆14Updated last year
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 4 months ago
- Triton Server Component for lightning.ai☆14Updated last year