huggingface / optimum-onnxLinks
π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime
β70Updated last week
Alternatives and similar repositories for optimum-onnx
Users that are interested in optimum-onnx are comparing it to the libraries listed below
Sorting:
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- FRP Forkβ175Updated 6 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β88Updated last month
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ70Updated 3 months ago
- β43Updated 2 weeks ago
- Hugging Face Jobsβ19Updated 3 months ago
- Proof of concept for running moshi/hibiki using webrtcβ19Updated 8 months ago
- A small rust-based data loaderβ31Updated 4 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β68Updated 3 months ago
- Rust crate for some audio utilitiesβ25Updated 7 months ago
- π· Build compute kernelsβ163Updated this week
- π€ Trade any tensors over the networkβ30Updated 2 years ago
- Thin wrapper around GGML to make life easierβ40Updated 4 months ago
- Use safetensors with ONNX π€β73Updated 3 weeks ago
- Datamodels for hugging face tokenizersβ85Updated last month
- β79Updated 3 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ102Updated 10 months ago
- Open-source reproducible benchmarks from Argmaxβ65Updated last week
- Google TPU optimizations for transformers modelsβ121Updated 9 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β41Updated this week
- β25Updated 10 months ago
- β59Updated last year
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated 2 years ago
- β21Updated last year
- A fast RWKV Tokenizer written in Rustβ54Updated 2 months ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β140Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)β34Updated 8 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.β52Updated 11 months ago
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago