microsoft / onnxruntime-web-benchmarkLinks

ONNX Runtime Web benchmark tool

☆8

Alternatives and similar repositories for onnxruntime-web-benchmark

Users that are interested in onnxruntime-web-benchmark are comparing it to the libraries listed below

Sorting:

Narsil / hf-chat
☆26Updated 5 months ago
kyutai-labs / moshi-webrtc
Proof of concept for running moshi/hibiki using webrtc
☆19Updated 3 months ago
huggingface / transformers.js-benchmarking
☆15Updated last month
mithril-security / tokenizers-wasm
wasm bindings for huggingface tokenizers library
☆34Updated 2 years ago
foundation-model-stack / fms-model-optimizer
FMS Model Optimizer is a framework for developing reduced precision neural network models.
☆20Updated this week
huggingface / optimum-furiosa
Accelerated inference of 🤗 models using FuriosaAI NPU chips.
☆26Updated 11 months ago
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
leo-du / llama2.rs
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆38Updated last year
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆55Updated last year
catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆55Updated last year
ngxson / ggml-easy
Thin wrapper around GGML to make life easier
☆34Updated this week
huggingface / ember
ANE accelerated embedding models!
☆17Updated 5 months ago
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
kyutai-labs / kaudio
Rust crate for some audio utilities
☆23Updated 2 months ago
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆14Updated last year
josephrocca / rwkv-v4-web
BlinkDL's RWKV-v4 running in the browser
☆46Updated 2 years ago
unslothai / llama.cpp
LLM inference in C/C++
☆77Updated 3 weeks ago
sdpython / onnxcustom
Tutorial on how to convert machine learned models into ONNX
☆16Updated 2 years ago
Narsil / bloomserver
☆39Updated 2 years ago
Intelligent-Internet / II-Commons
This repository `II-Commons` contains tools for managing text and image datasets, including loading, fetching, and embedding large datase…
☆12Updated 2 weeks ago
Systemcluster / kitoken
Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…
☆25Updated 2 months ago
tairov / lamatune
LLama implementations benchmarking framework
☆12Updated last year
character-ai / MuKoe
☆54Updated last year
LLM360 / k2-data-prep
☆20Updated last year
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆67Updated 10 months ago
nunoplopes / torchy
A tracing JIT compiler for PyTorch
☆13Updated 3 years ago
Stability-AI / gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
☆13Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
isEmmanuelOlowe / llm-cost-estimator
Estimating hardware and cloud costs of LLMs and transformer projects
☆16Updated last year
RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆41Updated last year