huggingface / optimum-onnxLinks
π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
β43Updated this week
Alternatives and similar repositories for optimum-onnx
Users that are interested in optimum-onnx are comparing it to the libraries listed below
Sorting:
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β87Updated 2 weeks ago
- Google TPU optimizations for transformers modelsβ120Updated 7 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last week
- vLLM adapter for a TGIS-compatible gRPC server.β39Updated this week
- β42Updated last week
- π€ Trade any tensors over the networkβ30Updated last year
- A massively multilingual modern encoder language modelβ80Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ88Updated 3 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ68Updated last month
- π· Build compute kernelsβ143Updated this week
- Open-source reproducible benchmarks from Argmaxβ58Updated last week
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β67Updated 2 months ago
- β83Updated 3 months ago
- β124Updated 10 months ago
- FRP Forkβ176Updated 5 months ago
- Experiments with BitNet inference on CPUβ54Updated last year
- Simple high-throughput inference libraryβ127Updated 4 months ago
- Rust crate for some audio utilitiesβ26Updated 6 months ago
- Train, tune, and infer Bamba modelβ132Updated 3 months ago
- A small rust-based data loaderβ31Updated 3 months ago
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.β68Updated last month
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated 2 years ago
- β69Updated 2 months ago
- Implementation of a Light Recurrent Unit in Pytorchβ48Updated 11 months ago
- Load compute kernels from the Hubβ283Updated this week
- Hugging Face Jobsβ19Updated 2 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexibleβ¦β78Updated last week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated last year
- β59Updated last year