huggingface / optimum-onnxLinks
π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
β95Updated last week
Alternatives and similar repositories for optimum-onnx
Users that are interested in optimum-onnx are comparing it to the libraries listed below
Sorting:
- π· Build compute kernelsβ192Updated this week
- A small rust-based data loaderβ33Updated 3 weeks ago
- Rust crate for some audio utilitiesβ25Updated 9 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 2 months ago
- β43Updated last month
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ70Updated 4 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β88Updated 3 weeks ago
- Hugging Face Jobsβ19Updated 4 months ago
- Google TPU optimizations for transformers modelsβ124Updated 10 months ago
- Datamodels for hugging face tokenizersβ86Updated last week
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β67Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.β45Updated this week
- β86Updated 5 months ago
- Simple high-throughput inference libraryβ150Updated 6 months ago
- Thin wrapper around GGML to make life easierβ40Updated last month
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β139Updated last year
- High-throughput tensor loading for PyTorchβ209Updated this week
- Proof of concept for running moshi/hibiki using webrtcβ19Updated 9 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ100Updated 6 months ago
- Efficient non-uniform quantization with GPTQ for GGUFβ53Updated 2 months ago
- Use safetensors with ONNX π€β76Updated 2 months ago
- β13Updated 10 months ago
- Load compute kernels from the Hubβ348Updated this week
- Fast, Modern, and Low Precision PyTorch Optimizersβ116Updated 3 months ago
- implement llava using candleβ15Updated last year
- FRP Forkβ177Updated 8 months ago
- β124Updated last year
- β53Updated 9 months ago
- β97Updated 6 months ago
- π€ Trade any tensors over the networkβ30Updated 2 years ago