microsoft / onnxruntime-web-benchmarkLinks
ONNX Runtime Web benchmark tool
☆8Updated last year
Alternatives and similar repositories for onnxruntime-web-benchmark
Users that are interested in onnxruntime-web-benchmark are comparing it to the libraries listed below
Sorting:
- ☆26Updated 5 months ago
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- ☆15Updated last month
- wasm bindings for huggingface tokenizers library☆34Updated 2 years ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- Accelerated inference of 🤗 models using FuriosaAI NPU chips.☆26Updated 11 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Experiments with BitNet inference on CPU☆55Updated last year
- Thin wrapper around GGML to make life easier☆34Updated this week
- ANE accelerated embedding models!☆17Updated 5 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Rust crate for some audio utilities☆23Updated 2 months ago
- Rust bindings for CTranslate2☆14Updated last year
- BlinkDL's RWKV-v4 running in the browser☆46Updated 2 years ago
- LLM inference in C/C++☆77Updated 3 weeks ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- ☆39Updated 2 years ago
- This repository `II-Commons` contains tools for managing text and image datasets, including loading, fetching, and embedding large datase…☆12Updated 2 weeks ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆25Updated 2 months ago
- LLama implementations benchmarking framework☆12Updated last year
- ☆54Updated last year
- ☆20Updated last year
- Evaluation of bm42 sparse indexing algorithm☆67Updated 10 months ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Estimating hardware and cloud costs of LLMs and transformer projects☆16Updated last year
- A converter and basic tester for rwkv onnx☆41Updated last year