huggingface / optimum-onnxLinks
π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
β105Updated last week
Alternatives and similar repositories for optimum-onnx
Users that are interested in optimum-onnx are comparing it to the libraries listed below
Sorting:
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 3 months ago
- β90Updated 5 months ago
- A small rust-based data loaderβ34Updated last month
- π€ Trade any tensors over the networkβ30Updated 2 years ago
- Datamodels for hugging face tokenizersβ86Updated last month
- Rust crate for some audio utilitiesβ25Updated 9 months ago
- Hugging Face Jobsβ19Updated 5 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ23Updated last year
- β59Updated last year
- Use safetensors with ONNX π€β78Updated 2 months ago
- PyLate efficient inference engineβ68Updated 3 months ago
- Fast, Modern, and Low Precision PyTorch Optimizersβ119Updated this week
- Google TPU optimizations for transformers modelsβ131Updated last week
- π· Build compute kernelsβ195Updated last week
- FRP Forkβ177Updated 8 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β88Updated last month
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β67Updated 3 weeks ago
- β53Updated 10 months ago
- NLP with Rust for Python π¦πβ70Updated 7 months ago
- A file utility for accessing both local and remote files through a unified interface.β44Updated 2 weeks ago
- ML/DL Math and Method notesβ65Updated 2 years ago
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated 2 years ago
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago
- β101Updated 6 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β46Updated this week
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β44Updated last year
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ72Updated 5 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β154Updated 5 months ago
- β13Updated last week