microsoft / onnxruntime-web-benchmark
ONNX Runtime Web benchmark tool
☆8Updated last year
Alternatives and similar repositories for onnxruntime-web-benchmark:
Users that are interested in onnxruntime-web-benchmark are comparing it to the libraries listed below
- Rust crate for some audio utilities☆23Updated 2 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated 2 months ago
- ☆26Updated 4 months ago
- ☆15Updated last week
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆17Updated last week
- NanoGPT (124M) in 5 minutes☆9Updated 2 months ago
- Use safetensors with ONNX 🤗☆56Updated 2 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆52Updated 2 weeks ago
- LLama implementations benchmarking framework☆12Updated last year
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆58Updated last week
- ☆48Updated 4 years ago
- A small rust-based data loader☆24Updated 4 months ago
- Rust bindings for CTranslate2☆14Updated last year
- Sentence Embedding as a Service☆15Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- ANE accelerated embedding models!☆16Updated 4 months ago
- Thin wrapper around GGML to make life easier☆27Updated last week
- Make triton easier☆47Updated 10 months ago
- Experiments with BitNet inference on CPU☆54Updated last year
- Profile your CoreML models directly from Python 🐍☆27Updated 6 months ago
- ⚡Delightful WebNN resources, curated list of awesome things around WebNN ecosystem.😎☆56Updated 2 weeks ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- Accelerated inference of 🤗 models using FuriosaAI NPU chips.☆26Updated 10 months ago
- 🤗 Optimum ExecuTorch☆35Updated last week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆53Updated last year
- ☆39Updated 2 years ago
- Open-source and reproducible benchmarks for Speaker Diarization☆23Updated 3 weeks ago
- wasm bindings for huggingface tokenizers library☆34Updated 2 years ago
- llama.cpp gguf file parser for javascript☆42Updated 4 months ago