huggingface / transformers.js-benchmarkingLinks
β15Updated 2 months ago
Alternatives and similar repositories for transformers.js-benchmarking
Users that are interested in transformers.js-benchmarking are comparing it to the libraries listed below
Sorting:
- β‘Delightful WebNN resources, curated list of awesome things around WebNN ecosystem.πβ65Updated 3 weeks ago
- Browse, search, and visualize ONNX models.β32Updated 2 months ago
- Train text generation model with JavaScript.β15Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ102Updated 2 years ago
- Inference Llama 2 in one file of pure JavaScript(HTML)β33Updated last month
- Simple text to phones converter using eSpeak NG.β30Updated 6 months ago
- JavaScript bindings for the ggml-js libraryβ43Updated 3 months ago
- Thin wrapper around GGML to make life easierβ36Updated 3 weeks ago
- A JavaScript implementation of Llama 3 using node-mlx.β73Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!β25Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.β66Updated last year
- Profile your CoreML models directly from Python πβ28Updated 9 months ago
- β14Updated 7 months ago
- β26Updated 7 months ago
- β46Updated 3 months ago
- trying to make WebGPU a bit easier to useβ16Updated last year
- Website with current metrics on the fastest AI models.β41Updated 8 months ago
- A polyfill for the WICG Observableβ12Updated last week
- Node.js binding of llama.cppβ12Updated last week
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js packageβ26Updated last week
- Record and stream WAV audio data in the browser across all platformsβ85Updated 8 months ago
- Browser based ML Inference | OpenAI compliant | Run models like DeepSeek, Llama 3.2, NomicEmbed, KokoroTTS, and moreβ37Updated 4 months ago
- β60Updated last week
- Using headless Chrome on server side environments for true client side browser emulation with NVIDIA T4 GPUs for Web AI model testing or β¦β79Updated last year
- llama.cpp gguf file parser for javascriptβ43Updated 7 months ago
- Retrieve large (GBs) AI binary model files from cloud, cache locally as sharded blobs to load faster on 2nd page load, returns stored filβ¦β28Updated 2 months ago
- GGML implementation of BERT model with Python bindings and quantization.β26Updated last year
- A cog implementation of Nvidia's Triton serverβ17Updated 8 months ago
- β84Updated this week
- Code for training & inference with FLAN family of modelsβ17Updated 2 years ago