huggingface / optimum-onnxLinks
π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
β112Updated 3 weeks ago
Alternatives and similar repositories for optimum-onnx
Users that are interested in optimum-onnx are comparing it to the libraries listed below
Sorting:
- A small rust-based data loaderβ34Updated 2 months ago
- Rust crate for some audio utilitiesβ26Updated 10 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 4 months ago
- π€ Trade any tensors over the networkβ30Updated 2 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β89Updated last week
- Datamodels for hugging face tokenizersβ86Updated 2 weeks ago
- Hugging Face Jobsβ19Updated 6 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β47Updated this week
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ75Updated last week
- FRP Forkβ177Updated 9 months ago
- β90Updated 6 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β68Updated last month
- Use safetensors with ONNX π€β81Updated last week
- π· Build compute kernelsβ213Updated this week
- β46Updated 3 months ago
- Fast, Modern, and Low Precision PyTorch Optimizersβ120Updated 3 weeks ago
- Google TPU optimizations for transformers modelsβ132Updated last month
- Proof of concept for running moshi/hibiki using webrtcβ19Updated 10 months ago
- PyLate efficient inference engineβ69Updated last week
- β53Updated 11 months ago
- Simple high-throughput inference libraryβ155Updated 8 months ago
- β102Updated 7 months ago
- β59Updated last year
- β27Updated last year
- Train LLM on Hugging Face infraβ67Updated 2 months ago
- Python bindings for ggmlβ146Updated last year
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago
- A client library in Rust for Nvidia Triton.β30Updated 2 years ago
- β125Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β103Updated last year